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0 Nanbv diagnostics and vaccines. 



@ A family of cDNA sequences derived from hepatitis C virus (HCV) are provided. These sequences encode 
^antigens v/hich react immunologically with antibodies present in individuals with non-A non-B hepatitis (NANBH), 
but yyhich generally are absent from individuals infected with hepatitis A virus (HAV) or hepatitis B virus (HBV), 
and also are absent from control individuals. A comparison of these cDNA sequences with the sequences in 
Genebank. and with the sequences of hepatitis delta virus (HDV) and HBV shows a lack of substantial homology. 

<^A comparison of the sequences of amino acids encoded in the cDNA with the sequences of Flavivlruses 
indicates that HCV is a Ravivirus or RavHike virus. 

CD 

The HCV cDNA sequences are useful for the design of polynucleotide probes, and for the synthesis of 
*^ polypeptides which may be used in immunoassays. Both the polynucleotide probes and the polypeptides may 
00 be useful for the diagnosis of HCV-induced NANBH. and for screening blood bank specimens and donors for 
^ HCV infection. In addition, tiiese cDNA sequences may be useful for the synthesis of Immunogenic polypeptides 

which may be used in vaccines for tiie treatment, prophylactic and/or tiier^eutic. of HCV infection. Polypeptides 
©encoded within the cDNA sequences may also be used to raise antibodies against HCV antigens, and for the 

purification of antibodies directed against HCV antigens. These antibodies may be useful in immunoassays for 
UJ detecting HCV antigens associated with NANBH in individuals, and in blood bank donations. Moreover, tiiese 

antibodies may be used for treatment of NANBH in individuals. 

The reagents provided in tiie invention also enable tiie isolation of NANBH agent(s), and the propagation of 
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tiiese agent(s) in tissue culture systems. Moreover, they provide reagents which are useful for screening for 
antiviral agents for HCV, particularly In tissue culture or animal model systems. 
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NANBV DIAGNOSTICS AND VACCINES 



Technical Reld 

The invention relates to materials and methodologies for managing the spread of non-A. non-B hepatitis 
5 virus (NANBV) infection. More specifically, it relates to diagnostic DNA fragments, diagnostic proteins, 
diagnostic antibodies and protective antigens and antibodies for an etiologic agent of NANB hepatitis, i.e.* 
hepatitis C virus. 
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Background Art 

Non-A, Non-B hepatitis (NANBH) is a transmissible disease or family of diseases that are believed to be 
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viral-induced, and that are distinguishable from other forms of viral-associated liver diseases, including that 
caused by the known hepatitis viruses, i.e.. hepatitis A virus (HAV), hepatitis B virus (HBV), and delta 
hepatitis virus (HDV). as well as the hepatitis induced by cytomegalovirus (CMV) or Epstein-Ban^ virus 
(EBV). NANBH was first identified in transfused individuals. Transmission from man to chimpanzee and 

5 serial passage in chimpanzees provided evidence that NANBH is due to a transmissible infectious agent or 
agents. However, the transmissible agent responsible for NANBH is' still unidentified and the number of 
agents which are causative of the disease are unknown. . 

Epidemiologic evidence is suggestive that there may be three types of NANBH: the water-borne 
epidemic type; the blood or needle associated type: and the sporadically occunring (community acquired) 

70 type. However, the number of agents which may be the causative of NANBH are unknown. 

Clinical diagnosis and identification of NANBH has been accomplished primarily by exclusion of other 
viral markers. Among the methods used to detect putative NANBV antigens and antibodies are agar-gel 
diffusion, courrterimmunoelectrophoresis, immunofluorescence microscopy, immune electron microscopy, 
radioimmunoassay, and enzyme-linked immunosorbent assay. However, none of these assays has proved 

75 to be sufficiently sensitive, specific, and reproducible to be used as a diagnostic test for NANBH. 

Until now there has been neither clarity nor agreement as to the identity or specificity of the antigen 
antibody systems associated with agents of NANBH. This is due. at least in part, to the prior or co-infection 
of HBV with NANBV in individuals, and to the known complexity of the soluble and particulate antigens 
associated with HBV, as well as to the integration of HBV DNA into the genome of liver cells. In addition. 

20 there is the possibility that NANBH is caused by more than one infectious agent, as well as the possibility 
that NANBH has been misdiagnosed. Moreover, It Is unclear what the serological assays detect in the 
serum of patients with NANBH. It has been postulated that the agar-gel diffusion and counterimmunoelec- 
trophoresis assays detect autoimmune responses or non-specific protein interactions that sometimes occur 
between serum specimens, and that they do not represent specific NANBV antigen-antibody reactions. The 

25 immunofluorescence, and enzyme-linked immunosorbent and radioimmunoassays appear to detect low 
levels of a rheumatoid-factor-like material that is frequently present in the serum of patients with NANBH as 
well as in patients with other hepatic and nonhepatic diseases. Some of the reactivity detected may 
represent antibody to host-determined cytoplasmic antigens. 

There are a number of candidate NANBV. See. for example the reviews by Prince (1983). Feinstone 

30 and Hoofnagle (1984), and Overby (1985. 1986, 1987) and the article by Iwarson (1987). However, there is 
no proof tiiat any of these candidates represent the etiological agent of NANBH. 

The demand for sensitive, specific methods for screening and identifying carriers of NANBV and 
NANBV contaminated blood or blood products Is significant Post-transfusion hepatitis (PTH) occurs in 
approximately 10% of transfused patients, and NANBH accounts for up to 90% of these cases. The major 

35 problem in this disease is the frequent progression to chronic liver damage (25-55%). 

Patient care as well as the prevention of transmission of NANBH by blood and blood products or by 
close personal contact require reliable diagnostic and prognostic tools to detect nucleic acids, antigens and 
antibodies related to NANBV. In addition, there is also a need for effective vaccines and immunotherapeutic 
therapeutic agents for the prevention and/or treatment of the disease. 

40 

Disclosure of the Invention 

The invention pertains to the isolation and characterization of a newly discovered etiologic agent of 
45 NANBH, hepatitis C virus (HCV). More specifically, the invention provides a family of cDNA replicas of 
portions of HCV genome. These cDNA replicas were isolated by a technique which included a novel step of 
screening expression products from cDNA libraries created from a particulate agent in infected tissue with 
sera from patients with NANBH to detect newly synthesized antigens derived from the genome of the 
heretofore unisoialed and uncharacterized viral agent and of selecting clones which produced products 
so which reacted immunologically only with sera from infected individuals as compared to non-infected 
individuals. 

Studies of tiie nature of the genome of tiie HCV, utiliang probes derived from tiie HCV cDNA. as well 
as sequence infonnation contained within the HCV cDNA. are suggestive that HCV is a Ravivirus or a Ravl- 
like virus. 

55 Portions of tiie cDNA sequences derived from HCV are useful as probes to diagnose the presence of 
virus in samples, and to isolate naturally occurring variants of the virus. These cDNAs also make available 
polypeptide sequences of HCV antigens encoded within the HCV genome(s) and penmits tiie production of 
polypeptides which are useful as standards or reagents in diagnostic tests and/or as components of 
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vaccines. Antibodies, botii polyclonal and mohoclohal. directed against HCV epitopes contained within these 
polypeptide sequences are also useful for diagnostic tests, as therapeutic agents, for screening of antiviral 
agents, and for the isolation of the NANBV agent frorn which these cDNAs derive. In addition, by utilizing 
probes derived from these cDNAs it is possible to isolate and sequence other portions of the HCV genome, 
5 thus giving rise to additional probes and polypeptides which are useful in the diagnosis and/or treatment, 
both prophylactic and therapeutic, of NANBH. 

Accordingly with respect to polynucleotides, some aspects of tiie invention are: a purified HCV 
polynucleotide; a recombinant HCV polynucleotide; a recombinant polynucleotide comprising a sequence 
derived from an HCV genome or from HCV cDNA; a recombinant polynucleotide encoding an epitope of 
10 HCV; a recombinant vector containing the any of the above riecombinant polynucleotides, and a host cell 
transformed with any of these vectors. 

Other aspects of the invention are: a recomtwnant expression system comprising an open reading frame 
(ORF) of DMA derived from an HCV genome or from HCV cDNA, wherein the ORF Is operably linlced to a 
control sequence compatible with a desired host, a ceil transformed with the recombinant expression 
15 system, and a polypeptide produced by the transformed cell. 

Still other aspects of the invention are: purified HCV, a preparation of polypeptides from the purified 
HCV; a purified HCV polypeptide; a purified polypeptide comprising an epitope which is immunologically 
identifiable with an epitope contained in HCV. 

Included aspects of the invention are a recombinant HCV polypeptide; a recombinant polypeptide 
20 comjprised of a sequence derived from an HCV genome or from HCV cDNA; a recombinant polypeptide 
comprised of an HCV epitope; and a fusion polypeptide comprised of an HCV polypeptide. 

Also included in tiie invention are a monoclonal antibody directed against an HCV epitope; and a 
purified preparation of polyclonal antibodies directed against an HCV epitope. 

Another aspect of the invention is a particle which is immunogenic against HCV infection comprising a 
25 ndn-HCV polypeptide having an amino acid sequence capable of forming a particle when said sequence is 
produced in a eukaryotic host, and an HCV epitope. 

Still another aspect of the invention is a polynucleotide probe for HCV. 

Aspects of the invention which pertain to kits are those for: analyzing samples for the presence of 
polynucleotides derived from HCV comprising a polynucleotide probe containing a nucleotide sequence 
30 from HCV of about 8 or more nucleotides, in a suitable container; analyzing samples for the presence of an 
HCV antigen comprising an antibody directed against the HCV antigen to be detected, in a suitable 
container; analyzing samples for Vne presence of an antibodies directed against an HCV antigen comprising 
a polypeptide containing an HCV epitope present in the HCV antigen, in a suitable container. 

Oti^er aspects of tiie invention are: a polypeptide comprised of an HCV epitope, attached to a solid 
35 substrate; and an antibody to an HCV epitope, attached to a solid substrate. 

Still other aspects of the invention are: a method for producing a polypeptide containing an HCV 
epitope comprising incubating host cells transformed with an expression vector containing a sequence 
encoding a polypeptide containing an HCV epitope under conditions which allow expression of said 
polypeptide; and a polypeptide containing an HCV epitope produced by this method. 
40 The invention also includes a method for detecting HCV nucleic acids in a sample comprising reacting 
nucleic acids of the sample with a prot>e for an HCV polynucleotide under conditions which allow tiie 
formation of a polynucleotide duplex between the probe and the HCV nucleic acid from the sample; and 
*^ detecting a polynucleotide duplex which contains tiie probe. 

Immunoassays are also included in the invention. These include an immunoassay for detecting an HCV 
45 antigen comprising Incubating a sample suspected of containing an HCV antigen witii a profc>e antibody 
directed against the HCV antigen to be detected under conditions which allow the formation of an antigen- 
antibody complex; and detecting an antigen-antibody complex containing the probe antibody. An im- 
munoassay for detecting antibodies directed against an HCV antigen comprising incubating a sample 
suspected of containing anti-HCV antibodies with a probe polypeptide which contains an epitope of the 
50 HCV. under conditions which allow tiie formation of an antibody-antigen complex; and detecting tiie 
antibody-antigen complex containing the probe antigen. 

Also included in the invention are vaccines for treatment of HCV infection comprising an immunogenic 
peptide containing an HCV epitope, or an inactivated preparation of HCV, or an attenuated preparation of 
HCV. 

55 Another aspect of the invention is a tissue culture grown cell infected with HCV. 

Yet another aspect of the Invention Is a method for producing antibodies to HCV comprising administer- 
ing to an individual an isolated immunogenic polypeptide containing an HCV epitope in an amount sufficient 
to produce an immune response. 
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Still another aspect of the invention is a method for isolating cDNA derived from the genome of an 
unidentified infectious agent comprising: (a) providing host cells transformed with expression vectors 
containing a cDNA library prepared from nucleic adds isolated from tissue infected with the agent and 
growing said host cells under conditiohs which allow expression of polypeptide(s) encoded in the cDNA; (b) 

5 interacting the expression products of the cDNA with an antibody containing body component of an 
individual infected with said infectious agent under conditions which allow an immunoreactibn, and detecting 
antibody-antigen complexes formed as a result of the interacting; (c) growing host ceils which express 
polypeptides that form antibody-antigen complexes in step (b) under conditions which allow their growth as 
indhridual clones and isolating said clones; (d) growing cells from the clones of (c) under conditibhs which 

10 allow expression of polypeptide{is) encoded within the cDNA, and interacting the expression products with 
antibody containing body components of individuals other than the Individual in step (a) who are infected 
with the infectious agent and with control individuals uninfected with the agent, and detecting antibody- 
antigen complexes formed as a result of the interacting; (e) growing host ceils which express polypeptides 
that form ahtfbody-antigen complexes vvlth antibody containing body components of infected individuals and 

75 individuals suspected of being infected, and not with said components of control individuals, under 
conditions which allow their growth as individual clones and isolating said clones; and (f) isolating the cDNA 
from the host cell clones of (e). 



20 Brief Description of the Drawings 

Fig. 1 shows the double-stranded nucleotide sequence of the HCV cDNA insert in clone 5-1-1, and 
the putative amino acid sequence of the polypeptide encoded therein. 

Fig. 2 shows the homologies of the overlapping HCV cDNA sequences in clones 5-1-1, 81, 1-2. and 

25 91. 

Rg. 3 shows a composite sequence of HCV cDNA derived from overlapping clones 81, 1-2, and 91. 
and the anhino acid sequence encoded therein. 

Rg. 4 shows this double-stranded nucleotide sequence of the HCV cDNA insert in clone 81 . and the 
putative arhino acid sequence of the polypeptide encoded therein. 
30 * Rg. 5 shows the HCV cDNA sequence in clone 36, the segment which overlaps the NANBV cDNA of 
clone 81, and the polypeptide sequence encoded within clone 36. 

Rg. 6 shows the combined ORF of HCV cDNAs in clones 36 and 81. and the polypeptide encoded 
therein. 

Rg. 7 shows the HCV cDNA sequence In clone 32, the segment which overlaps clone 81 , and the 
35 polypeptide encoded therein. 

Fig. 8 shows the HCV cDNA sequence in clone 35, the segment which overlaps clone 36, and the 
polypeptide encoded therein. 

Fig. 9 shows the combined ORF of HCV cDNAs in clones 35. 36, 81, and 32. and the polypeptide 
encoded therein. 

40 Rg. 10 shows the HCV cDNA sequence in clone 37b, the segment which overlaps clone 35, and the 

polypeptide encoded therein. 

Rg. 11 shows the HCV cDNA sequence in clone 33b, the segment which overlaps clone 32, and the 
polypeptide encoded therein. 

Rg. 12 shows the HCV cDNA sequence in clone 40b, the segment which overlaps clone 37'b, and the 
4$ polypeptide encoded therein. 

Fig. 13 shows the HCV cDNA sequence in clone 25c. the segment which overlaps clone 33b, and the 
polypeptide encoded therein, 

Rg. 14 shows tiie nucleotide sequence and polypeptide encoded tiierein of the ORF which extends 
through the HCV cDNAs in clones 46b, 37b. 35, 36, 81, 32, 33b, and 25c. 
50 Rg, 15 shows the HCV cDNA sequence in clone ,33c, the segment which overlaps clones 40b and 

33c, and tiie amino acids encoded therein. 

Rg. 16 shows the HCV cDNA sequence in clone 8h, the segment which bverl^s clone 33c, and the 
amino acids encoded therein, 

Rg. 17 shows the HCV cDtMA sequence in clone 7e. the segment which overlaps clone 8h. and the 
55 amino acids encoded therein. 

Rg. 18 shows the HCV cDNA sequence in clone 14c, the segment which overlaps clone 25c, and the 
amino acids encoded therein. 
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Fig. 19 shows the HCV cDNA sequence in clone 8f, the segment which overlaps clone 14c, and the 
amino acids encoded therein. 

Rg. 20 shows the HCV cDNA sequence In clone 33f» the segment which overlaps clone 8f. and the 
amino acids encoded therein, 
5 Rg. 21 shows the HCV cDNA sequence in clone 33g» the segment which overlaps clone 33f. arid the 

amino acids encoded therein. 

Rg. 22 shows the HCV cDNA sequence in clone 7f, the segment which overlaps the sequence in 
clone 7e, and the arnino acids encoded therein. 

Rg. 23 shows the HCV cDNA sequence In clone 11b, the segment which overlaps the sequence in 
70 clone 7f , and the amino acids encoded therein. 

Rg. 24 shows the HCV cDNA sequence in clone 14i. the segment which overlaps the sequence in 
clone 11b, and the amino acids encoded therein, 

Rg. 25 shows the HCV cDNA sequence in cione 39c, the segment which overlaps the sequence in 
clone 33g. and the amino acids encoded therein. 
75 Rg. 26 shows a composite HCV cDNA sequence derived. from the aligned cDNAs In clones 141. lib. 

7f, 7e. 8h. 33c 40b 37b 35 36. 81, 32. 33b. 25c, 14c, 8f. 33f, 33g and 39c also shown is the amino acid 
sequence of the polypeptide encoded in the extended ORF in the derived sequence. 

Rg. 27 shows the sequence of the HCV cDNA in clone I2f. the segment which overlaps clone 14i, 
and the amino adds encoded therein. 
20 Rg. 28 shows the sequence of the HCV cDNA in clone 35f. the segment which overlaps clone 39c. 

and the amino acids encoded therein. 

Rg, 29 shows the sequence of the HCV cDNA in clone 19g, the segment which overlaps clone 35f. 
and the amino acids encoded therein. 

Rg. 30 shows the sequence of clone 26g, the segment which overlaps clon.e 19g, and the amino 
25 acids encoded therein. 

Rg. 31 shows the sequence of clone 15e, the segment which overlaps clone 26g, and the amino 
acids encoded therein. 

Rg. 32 shows the sequence in a composite cDNA, which was derived by aligning clones 12f through 
15e in the 5 to 3 direction; it also shows the amino acids encoded in the continuous ORF. 
30 Rg. 33 shows a photograph of Western blots of a fusion protein, SOD-NANB5-1-1. with chimpanzee 

semm from chimpanzees i nfe cted with BB-NANB. HAV, and HBV. 

Rg. 34 shows a photograph of Western blots of a fusion protein. SOD-NANBs-i-i, with serum from 
humans infected with NANBV, HAV. HBV. and from control humans. 

Rg. 35 is a map showing the significant features of the vector pAB24. 
35 Rg. 36 shows the putative amino acid sequence of the carboxy-terminus of the fusion polypeptide 

C1 00-3 and the nucleotide sequence encoding it. 

Rg. 37A Is a photograph of a coomassie blue stained polyacrylamide gel which identifies CI 00-3 
expressed in yeast 

Rg, 37B shows a Western blot of CI 00-3 with serum from a NANBV infected human. 
40 Rg. 38 shows an autoradiograph of a Northern blot of RNA isolated from the liver of a BB-NANBV 

infected chimpanzee, probed vwth BB-NANBV cDNA of clone 81. 

Rg. 39 shows an autoradiograph of NANBV nucleic add treated with RNase A or DNase I. and 
*^ probed with BB-NANBV cDNA of clone 81. 

Rg. 40 shows an autoradiograph of nucleic acids extracted from NANBV particles captured from 
45 infected plasma with anti-NANBs-1-1 , and probed with 32p.jabe!ed NANBV cDNA from clone 81. 

Rg, 41 shows autoradiographs of filters containing isolated NANBV nucleic acids, probed with ^^p. 
labeled plus and minus strand DNA probes derived from NANBV cDNA in clone 81. 

Rg, 42 shows the homologies between a polypeptide encoded in HCV cDNA and an NS protein from 
Dengue flavivirus. 

50 Rg, 43 shows a histograrh of the distribution of HCV infection in random samples, as determined by 

an EUSA screening. 

Rg. 44 shows a histogram of the distribution of HCV infection in random samples using two 
configurations of immunogibbulin-enzyme conjugate in an ELISA assay. 

Fig. 45 shows the sequences in a primer mix. derived from a conserved sequence in NS1 of 
55 ftavtviruses. 

Rg. 46 shows the HCV cDNA sequence in clone k9-1, the segments which overlaps the cDNA in Fig. 
26, and the amino adds encoded tiierein. 
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Fig. 47 shows the sequence in a composite cDNA which was derived by aligning clones l«9-1 through 
15e in the 5 to 3' direction; it also showa the amino acids encoded 'n the continuous ORF. 



5 Modes for Carrying Out the Invention 



1. Definitions 

10 The term "hepatitis C virus** has been reserved by workers In the field for an heretofore unknown 
etiologic agent of NANBH. Accordingly, as used herein, hepatitis C virus" (HCV) refers to an agent causitive 
of NANBH, which was formerly refenred to as NANBV and/or BB-NANBV. The terms HCV, NANBV, and BBr 
NANBV are used interchangeably herein. As an extension of this terminology, the disease caused by HCV, 
formerly called NANB hepatitis (NANBH), is called hepatitis C. The terms NANBH and hepatitis C may be 

75 used interchangeably herein. 

The term "HCV, as used herein, denotes a viral species which causes NANBH. and attenuated strains 
or defective Interfering particles derived therefrom. As shown infra., the HCV genome is comprised of RNA. 
It is known that RNA containing viruses have relatively high rates of spontaneous mutation, i.e.. reportedly 
on the order of 10~^ to 10~* per nucleotide (Relds & Knipe (1986)). Therefore, there are multiple strains 

20 within the HCV species described infra. The compositions and methods described herein, enable the 
propagation, identification, detection, and isolation of the various related strains. Moreover, they also allow 
the prepara^on of diagnostics and vaccines for the various strains, and have utility in screening procedures 
for anti-viral agents for pharmacologic use in that they inhibit replication of HCV. 

The information provided herein, although derived from one strain of HCV, hereinafter referred to as 

25 CDC/HCV1, is sufficient to allow a viral taxonomist to identify other strains which fall within the species. As 
described herein, we have discovered that HCV is a Flavivirus or Ravi-like virus. The morphology and 
composition of Ravivirus particles are known, and are discussed in Brinton (1986). Generally, with respect 
to morphology, Raviviruses contain a central nucleocapsid surrounded by a lipid bilayer. Virions are 
spherical and have a diameter of about 40-50 nm. Their cores are about 25-30 nm in diameter. Along the 

30 outer surface of the virion envelope are projections that are about 5-10 nm long witii terminal knobs about 2 
nm in diameter. 

HCV encodes an epitope which is immunologically identifiable with an epitope in the HCV genome from 
which the cDNAs described herein are derived; preferably the epitope is encoded in a cDNA described 
herein. The epitope is unique to HCV when compared to other known Raviviruses. The uniqueness of the 

35 epitope may be determined by its immunological reactivity with HCV and lack of immunological reactivity 
with other Ravivirus species. Methods for determining immunological reactivity are known in tiie art. for 
example, by radioimmunoassay, by Elisa assay, by hemagglutination, and several examples of suitable 
techniques for assays are provided herein. 

In addition to the above, tiie following parameters are applicable, either alone or in combination, in 

40 identifying a strain as HCV. Since HCV strains are evolutionarily related, it is expected that the overall 
homology of the genomes at the nucleotide level will be about 40% or greater, preferably about 60% or 
greater, and even more preferably about 80% or greater; and in addition that there will be corresponding 
contiguous sequences of at least about 13 nucleotides. The correspondence between the putative HCV 
strain genomic sequence and the CDC/CHI HCV cDNA sequence can be determined by techniques known 

45 in the art. For example, they can be determined by a direct comparison of the sequence information of the 
polynucleotide from the putative HCV, and tiie HCV cDNA sequence(s) described herein. For example, also, 
they can be determined by hybridization of the polynucleotides under conditions which form stable 
duplexes between homologous regions (for example, those which would be used prior to Si digestion), 
followed by digestion with single stranded specific nuclease(s), followed by size determination of the 

50 digested fragments. 

Because of tiie evolutionary relationship of tine strains of HCV, putative HCV strains are identifiable by 
their homology at the polypeptide level. Generally, HCV strains are more than about 40% honrlologous, 
preferably more than about 60% homologous, and even more preferably more than about 80% homologous 
at the polypeptide level. The techniques for determining amino acid sequence homology are known in the 
55 art For example, the amino acid sequence may be determined directiy and compared to the sequences 
provided herein. For example also, the nucleotide sequence of the genomic material of the putative HCV 
may be determined (usually via a cDNA intermediate); the amino add sequence encoded therein can be 
determined, and the corresponding regions compared. 
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As used herein, a polynucleotide "derived from" a designated sequence, for example, the HCV cDNA, 
particularly those exemplrfied in Rgs, 1-32. or from an HCV genome, refers to a polynucleotide sequence 
which is comprised of a sequence of approximately at least about 6 nucleotides, is preferably at least about 
8 nucleotides, is more preferably at least about 10-12 nucleotides, and even more preferably at least about 
5 15-20 nucleotides corresponding, i,e., homologous to or complementary to, a region of the designated 
nucleotide sequence. Preferably, the sequence of the region from which the polynucleotide is derived is 
homologous to or complementary to a sequence which is unique to an HCV genome. Whether or not a 
sequence is unique to the HCV genome can be determined by techniques known to those of skill in the art. 
For example, the sequence can be compared to sequences in databanks, e.g.. Genebank, to determine 
70 whether it is present in the uninfected host or other organisms. The sequence can also be compared to the 
'known sequences of other viral agents, including those which are known to induce hepatitis. e,g., HAV. 
HBV. and HDV, and to other members of the Ravlviridae. The correspondence or non-correspondence of 
the derived sequence to other sequences can also be determined by hybridization under the appropriate 
stringency conditions. Hybridization techniques for determining the complementarity of nucleic acid se- 
ts quences are known in the art. and are discussed infra. See also, for example, Maniatis et al. <1982). in 
addition, mismatches of duplex polynucleotides fomned by hybridization can be determined by known 
techniques, including for example, digestion with a nuclease such as S1 that specifically digests single- 
stranded areas in duplex polynucleotides. Regions from which typical DIMA sequences may be "derived" 
include but are not limited to. for example, regions encoding specific epitopes, as well as non-transcribed 
20 and/or non-translated regions. 

The derived polynucleotide is not necessarily physically derived from the nucleotide sequence shown, 
but may be generated in any manner. Including for example, chemical synthesis or DMA replication or 
reverse transcription or transcription, which are based on the infonmation provided by the sequence of 
bases in the region(s) from which the polynucleotide is derived. In addition, combinations of regions 
25 corresponding to that of the designated sequence may be modified in ways known in the art to be 
consistent with an intended use. 

Similariy. a polypeptide or amino acid sequence "derived from" a designated nucleic acid sequence, 
for example, the sequences in Rgs* 1-32, or from an HCV genome, refers to a polypeptide having an amino 
acid sequence identical to that of a polypeptide encoded in the sequence, or a portion thereof wherein the 
30 portion consists of at least 3-5 amino acids, and more preferably at least 8-10 amino acids, and even more 
preferably at least 11-15 amino adds, or which is immunologically identifiable with a polypeptide encoded 
in the sequence. 

A recombinant or derived polypeptide is not necessarily translated from a designated nucleic acid 
sequence, for exanple, the sequences in Rgs. 1-32. or from an HCV genome; it may be generated in any 
35 manner, including for example, chemical synthesis, or expression of a recombinant expression system, or 
isolation from mutated HCV. 

The term "recombinant polynucleotide" as used hierein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is not associated with all 
or a portion of the polynucleotide with which it is associated in nature or in the form of a library; and/or (2) 
40 is finked to a polynucleotide other than that to which it is linked in nature. 

The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of the molecule. 
^Thus. this term includes double- and single-stranded DNA, as well as double- and single stranded RI^IA ft 
also includes modified, for example, by methylation and/or by capping, and unmodified forms of the 
45 polynucleotide. 

As used herein, the tenm "HCV containing a sequence corresponding to a cDNA" means that the HCV 
contains a polynucleotide sequence which is homologous to or complementary to a sequence in the 
designated DMA; the degree of homology or complementarity to the cDNA will be approximately 50% or 
greater, will preferably be at least about 70%. and even more preferably will be at least alx)ut 90%. The 

50 sequences which correspond will be at least about 70 nucleotides, preferably at least about 80 nucleotides, 
and even more preferably at least at>out 90 nucleotides in length. The correspondence between the HCV 
sequence and the cDNA can be determined by techniques known in the art, including, for example, a direct 
comparison of the sequenced material with the cDNAs described, or hybridization and digestion with single 
strand nucleases, followed by size determination of the digested fragments. 

55 The term "purified of viral polynucleotide" refers to an HCV genome or fragment thereof which is 
essentially free. i.e., contains less than about 50%, preferably less than about 70%, and even more 
preferably less than about 90% of polypeptides with which the viral polynucleotide is naturally associated. 
Techniques for purifying viral polynucleotides from viral particles are known in the art, and include for 
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example, disruption of the particle wtth a chaotropic agent, and separation of the polynucleotide(s) and 
polypeptides by ion-exchange chromatography, affinity chromatography, and sedimentation according to 
density. 

The ierm "purified viral polypeptided" refers to an HCV polypeptide or fragment thereof which is 

5 essentially free, i.e.. contains less than about 50%, preferably less than about 70%. and even more 
preferably less than about 90%. of cellular components with which the viral polypeptide Is naturally 
associated. Techniques for purifying viral polypeptides are known in the art and examples of these 
techniques are discussed infra. 

"Recombinant host ceils", "host cells", "cells", "cell lines", "cell cultures:, and other such tenms 

70 denoting microorganisms or higher eukaryotic cell lines cultured as unicellular entities refer to celis which 
can be, or have been, used as recipients for recombinant vector or other transfer DNA. and include the 
progeny of the original cell which has been transfected. It is understood that the progeny of a single 
parental cell may not necessarily be completely identical In morphology or in genomic or total DNA 
complement as the original parent, due to accidental or deliberate mutation. Progeny of the parental cell 

75 which are sufficientiy similar to the parent to be characterized by tiie relevant property, such as the 
presence of a nucleotide sequence encoding a desired peptide, are included in the progeny intended by 
tills definition, and are covered by the above terms. 

A "replicon" is any genetic element, e.g.. a plasmid, a chromosome, a viais, that behaves as an 
autonomous unit of polynucleotide replication within a cell; i.e., capable of replication under its own control. 

20 A "vector" is a replicon in which another polynucleotide segment is attached, so as to bring about the 
replication and/or expression of the attached segment 

"Control sequence" refers to polynucleotide sequences which are necessary to effect the expression of 
coding sequences to which they are iigated. The nature of such control sequences differs depending upon 
the host organism; in prokaryotes, such control sequences generally include promoter, ribosomal binding 

25 site, and terminators; in eukary'otes, generally, such control sequences include promoters, terminators and, 
in some instances, enhancers. The temn "control' sequences" is intended to include, at a minimum, all 
components whose presence is necessary for expression, and may also include additional components 
whose presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a juxtaposition wherein the components so described are in a relationship 

30 permitting them to function in their intended manner. A control sequence "operaWy linked" to a coding 
sequence is Iigated in such a way that expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 

An "open reading frame" (ORF) is a region of a polynucleotide sequence which encodes a polypeptide; 
tills region may represent a portion of a coding sequence or a total coding sequence. 

35 A "coding sequence" is a polynucleotide sequence which is transcril:>ed into mRNA and/or t-anstated 
into a polypeptide when placed under ttie control of appropriate regulatory sequences. The boundaries of 
tiie coding sequence are determined by a translation start codon at the 5 -terminus and a translation stop 
codon at the s'-terminus. A coding sequence can include, but is not limited to mRNA. cDNA. and 
recombinant polynucleotide sequences. 

40 "Immunologically identifiable with/as" refers to the presence of epitope(s) and polypeptides(s) which are 
also present in and are unique to the designated polypeptide(s), usually HCV proteins. Immunological 
identity may be determined by antibody binding and/or competition in binding; these techniques are known 
to tiiose of average skill in the art. and are also illustrated infra. The uniqueness of an epitope can also be 
determined by computer searches of known data banks, e.g. Qenebank, for the polynucleotide sequences 

45 which ^code tiie epitope, and by amino acid sequence comparisons with other known proteins. 

As used herein, "epitope" refers to an antigenic detemninant of a polypeptide; an epitope could 
comprise 3 amino acids in a spatial conformation which is unique to the epitope, generally an epitope 
consists of at least 5 such amino acids, and more usually, consists of at least 8-10 such amino acids. 
Methods of determining the spatial conformation of amino acids are known in the art and include, for 

50 example, x-ray crystallography and 2-dimensional nuclear magnetic resonance. 

A polypeptide is "immunologically reactive" with an antibody when it binds to an antibody due to 
antit)ody recognition of a specific epitope contained within the polypeptide. Immunological reactivity may be 
determined by antibody binding, more particularly by the kinetics of antibody binding, and/or by competition 
in binding using as competitor(s} a known polypeptide(s) containing an epitope against which tiie antibody 

55 is directed. The techniques for determining whether a polypeptide is immunologically reactive with an 
antibody are known in the art. 

As used herein, tiie tenn "immunogenic polypeptide containing an HCV epitope" includes naturally 
occuning HCV polypeptides or fragments thereof, as well as polypeptides prepared by other means, for 
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example, chemical synthesis, or the expression of the polypeptjde in a recombinant organism. 

The term "polypeptide" refers to a rnolecular chain of amino acids and does not refer to a specific 
length of the product; thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide. This term also does not refer to post-expression modifications of the polypeptide, for example, 
5 giycosylations, acetylations. phosphorylations and the like. 

"Transformation", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, 
irrespective of the method used for the insertion, for example, direct uptake, transduction, or f-mating. The 
exogenous polynucleotide may be maintained as a non-integrated vector, for example, a plasmid, or 
alternatively, may be integrated into the host genome. 
10 "Treatment" as used herein refers to prophylaxis and/or therapy. 

An "individual", as used herein, refers to vertebrates, particularly members of the mammalian species, 
and includes but is not limited to domestic animals, sports animals, primates, and humans. 

As used herein, the "plus strand" of a nucleic acid contains the sequence that encodes the polypeptide. 
The "minus strand" contsuns a sequence which is complementary to that of the "plus strand". 
75 As used herein, a "positive stranded genome" of a virus is one in which the genome, whetiier RNA or 
DNA, is singie-stranded and which encodes a viral polypeptide(s). Examples of positive stranded RNA 
viruses include Togaviridae, Coronaviridae, Retroviridae. Picomaviridae, and Caliciviridae. Included also, are 
the Raviviridae, which were formerly classified as Togaviradae. See Fields & Knipe (1986). 

As used herein, "antibody confining body component" refers to a component of an individual's body 
20 which is a source of the antibodies of interest. Antibody containing body components are known in the art, 
and include but are not limited to, for example, plasma, serum, spinal fluid, lymph fluid, the external 
sections of the respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, white blood cells, and 
myelomas. 

As used herein, "purified HCV" refers to a preparation of HCV which has been Isolated from the cellular 
25 constituents' with which the virus is normally associated, and from other types of viruses which may be 
present in the infected tissue. The techniques for isolating viruses are known to those of skill in tiie art, and 
include, for example, centrifugatioh and affinity chromatography; a method of preparing purified HCV is 
discussed infra 

30 ' 

II. Description of the Invention 

The practice of the present invention will employ, unless otherwise indicated, conventional techniques 
of molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. 

35 Such techniques are explained fully in the literature. See e.g., Manlatis, Fitsch & Sambrook, MOLECULAR 
CLONING; A LABORATORY MANUAL (1982); DNA CLONING. VOLUMES I AND II (D.N Glover ed. 1985); 
OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC ACID HYBRIDIZATION (B.D. Hames & 
S.J. Higgins eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. Higgins eds. 1984); 
ANIMAL CELL CULTURE (R.I. Freshney ed. 1986); IMMOBILIZED CELLS AND ENZYMES (IRL Press, 

40 1986); B. Pert)al. A PRACTICAL GUIDE TO MOLECULAR CLONING (1984); the series, METHODS IN 
ENZYMOLOGY (Academic Press, Inc.); GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. 
Miller and M.P. Calos eds. 1987, Cold Spring Harbor Laboratory), Methods in Enzymology Vol. 154 and VoL 
155 (Wu and Grossman, and Wu, eds,, respectively), Mayer and Walker, eds. (1987), IMMUNOCHEMICAL 
METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press. London), Scopes, (1987), PROTEIN 

45 PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition (Springer-Veriag. N,Y.). and HANDBOOK 
OF EXPERIMENTAL IMMUNOLOGY, VOLUMES !-fV (D.M. Weir and C. C. Blackwell eds 1986). 

AH patents, patent applications, and publications mentioned herein, both supra and infra, are hereby 
incorporated herein by reference. 

The useful materials and processes of the present invention are made possible by the provision of a 

50 family of closely homologous nucleotide sequences isolated from a cDNA library derived from nucleic acid 
sequences present in the plasma of an HCV infected chimpanzee. This family of nucleotide sequences is 
not of human or chimpanzee origin, since it hybridizes to neither human nor chimpanzee genomic DNA 
from uninfected individuals, since nucleotides of Ms family of sequences are present oniy in liver and 
plasma of chimpanzees with HOy infection, and since the sequence is not present in Genebank. In addition, 

55 the family of sequences shows no significant homology to sequences contained within the HBV genome. 

The sequence of one member of the family, contained within clone 5-1-1, has one continuous open 
reading frame (ORF) which encodes a polypeptide of approximately 50 amino acids. Sera from HCV 
infected humans contain antibodies which bind to this polypeptide, whereas sera from non-infected humans 
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do not contain antibodies to this polypeptide. Rnally» whereas the sera from uninfected chimpanzees do not 
contain antibodies to this polypeptide, the antibodies are induced in chimpanzees following acute NANBH 
infection. Moreover, antibodies to this polypeptide are not detected in chimps and humans infected with 
HAV and HBV. By these ciiteria the sequence is a cDNA to a viral sequence, wherein the virus causes or is 
5 associated with NANBH; this cDNA sequence is shown in Rg. 1. As discussed infra, the cDNA sequence in 
clone 5-1-1 differs from that of the other Isolated cDNAs In that it contains 28 extra base pairs. 

A composite of other identified members of the cDNA family, which were isolated using as a probe a 
synthetic sequence equivalent to a fragment of the cDNA in clone 5-1-1. is shown in Rg. 3. A member of 
the cDNA family which was isolated using a synthetic sequence derived from the cDNA in clone 81 is 

10 shown in Rg. 5, and the composite of this sequence with that of clone 81 is shown in Rg. 6 Other 
members of the cDNA family, including those present in clones 12f, 141, 11b. 7f. 7e, 8h. 33c. 46b. 37b, 35, 
36, 81. 32, 33b, 25c, 14c. 8f, 33f. 33g. 39c. 35f. 19g, 26g and 15e are described in Section IV.A. A 
composite of the cDNAs in these clones is described in Section IV. A 19. and shown in Rg. 32. The 
composite cDNA shows that it contains one continuous ORF, and thus encodes a poiyprotein. This data is 

15 consistent with the suggestion, discussed infra., that HCV is a Ravivirus or fiavl-iike virus. 

The availability of this family of cDNAs shown in Rgs. 1-32. inclusive, permits the construction of DNA 
probes and polypeptides useful in diagnosing NANBH due to HCV infection and in screening blood donors 
as well as donated blood and blood products for infection. For example, from the sequences it is possible to 
synthesize DNA oligomers of about 8-10 nucleotides, or larger, which are useful as hybridization probes to 

20 detect the presence of the viral genome in. for example, sera of subjects suspected of harboring the virus, 
or for screening donated blood for the presence of the virus. The family of cDNA sequences also allows the 
design and production of HCV specific polypeptides which are useful as diagnostic reagents for the 
presence of antibodies raised during NANBH. Antibodies to purified polypeptides derived from the cDNAs 
may also be used to detect viral antigens in infected Individuals and in blood. 

25 Knowledge of tiiese cDNA sequences also enable the design and production of polypeptides which 
may be used as vaccines against HCV and also for the production of antibodies, which in turn may be used 
for protection against tiie disease, and/or for therapy of HCV infected individuals. 

Moreover, the family of cDNA sequences enables furtfier characterization of the HCV genome. 
Polynucleotide probes derived from ttiese sequences may be used to screen cDNA libraries for additional 

30 overiapping cDNA sequences, which, in turn, may be used to obtain more overiapping sequences. Unless 
the genome is segmented and the segments lack common sequences, this technique may be used to gain 
the sequence of the entire genome. However, if the genome is segmented, other segments of the genome 
can be obtained by repeating the lambda-gtll serological screening procedure used to isolate tiie cDNA 
clones described herein, or alternatively by isolating the genome from purified HCV particles. 

3^ The family of cDNA sequences and the polypeptides derived from these sequences, as well as 
antibodies directed against these polypeptides are also useful in the isolation and identification of the BB- 
NANBV agent(s). For example, antibodies directed against HCV epitopes contained in polypeptides derived 
from the cDNAs may be used in processes based upon affinity chromatography to isolate the virus. 
Aitematively. tiie antibodies may be used to identify viral particles isolated by other techniques. The viral 

40 antigens arKl the genomic material within the isolated viral particles may tiien be further characterized. 

The infomiation obtained from further sequencing of the HCV genome(s), as well as from further 
characterization of tiie HCV antigens and characterization of the genome enables the design and syntilesis 
of additional probes and polypeptides and antibodies which may be used for diagnosis, for prevention, and 
for therapy of HCV induced NANBH, and for screening for infected blood and blood-related products. 

45 The availability of probes for HCV. including antigens and antibodies, and polynucleotides derived from 
tiie genome from which tiie family of cDNAs is derived also allows for the development of tissue culture 
systems which will be of major use in elucidating tiie biology of HCV. This in turn, may lead to the 
development of new treatment regimens based upon antiviral compounds which preferentially Inhibit the 
replication of, or infection by HCV. 

50 The method used to identify and isolate tiie etioiogic agent for NANBH is novel, and it may be 
applicable to the identification and/or isolation of heretofore uncharacterized agents which contain a 
genome, and which are associated witii a variety of diseases, including those induced by viruses, viroids, 
bacteria, fungi and parasites. In ttiis metiiod, a cDNA library was created from tiie nucleic acids present in 
infected tissue from an infected individual. The library was created in a vector which allowed tiie expression 

55 of polypeptides encoded in the cDNA. Clones of host cells containing the vector, which expressed an 
immunologically reactive fragment of a polypeptide of tiie etioiogic agent were selected by immunological 
screening of tiie expression products of the library witii an antibody containing body comporient from 
anottier individual previously infected with the putative agent. The steps in the immunological screening 
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technique included interacting the expression products of the cDNA containing vectors with the antibody 
containing body component of a second infected individual, and detecting the formation of antibody-antigen 
complexes between the expression product(s) and antibodies of the second infected individual. The isolated 
clones are screened further immunologically by interacting their expression products with the antibody 
contaning body components of other individuals infected with the putative agent and with control individuals 
uninfected with the putative agent, and detecting the fonnnation of antigen-antibody complexes with 
antibodies from the infected individuals; and the cDNA containing vectors which encode polypeptides which 
react immunologically with antibodies from infected individuals and individuals suspected of being infected 
with the agent but not with control individuals are isoiated. the infected individuals used for the 
construction of tiie cDNA library, and for the immunological screening need not be of the same species. 

The cDNAs isolated as a result of tiiis method, and tiieir expression products, and antibodies directed 
against the expression products, are useful in characterizing and/or capturing tiie etiologic agent. As 
described in more detail infra, this method has been used successfully to Isolate a family of cDNAs derived 
from the HCV genome. _ 



II A Preparation of the cDNA Sequence 

Pooled semm from a chimpanzee with chronic HCV infection and containing a high titer of the virus, 
i.e.. at least 10^ chimp infectious doses/ml (CID/ml) was used to isolate viral particles; nucleic acids isolated 
from these particles was used as tiie template in tiie construction of a cDNA library to the viral genome. 
The procedures for isolation of putative HCV particles and for consti-ucting tiie cDNA library In iambda-gt11 
is discussed in Section IVA1. Lambda-gt11 is a vector that has been developed specifically to express 
inserted cDNAs as fusion polypeptides witii beta-galactosidase and to screen large numbers of recombinant 
phage with specific antisera raised against a defined antigen. The Iambda-gt11 cDNA library generated from 
a cDNA pool containing cDNA of approximate mean size of 200 base pairs was screened for encoded 
epitopes tiiat could bind specifically witii sera derived from patients who had previously experienced NANB 
hepatitis. Huynh. T.V. et al. (1985). Approximately 10^ phages were screened, and five positive phages 
were identified, purified, and then tested for specificity of binding to sera from different humans and 
chimpanzees previously infected witii tiie HCV agent One of tine phages. 5-1-1. bound 5 of the 8 human 
sera tested. This binding appeared selective for sera derived from patients with prior NANB hepatitis 
infections since 7 normal blood donor sera did not exhibit such binding. 

The sequence of the cDNA in recombinant phage 5-1-1 was determined, and is shown in Rg. 1. The 
polypeptide encoded by tiiis cloned cDNA. which is in the same translational frame as the N-terminal beta- 
Galactosidase moiety of tiie fusion polypeptide is shown above the nucleotide sequence. This translational 
GRP. therefore, encodes an epitope(s) specifically recognized by sera from patients with NANB hepatitis 
infections. 

The availability of the cDNA in recombinant phage 5-1-1 has allowed for ttie isolation of other clones 
containing additional segments and/or alternative segments of cDNA to the viral genome. The lambda-gtll 
cDNA library described supra, was screened using a synthetic polynucleotide derived from tiie sequence of 
tiie cloned 5-1-1 cDNA. This screening yielded three otiier clones, which were identified as 81. 1-2 and 91; 
the cDNAs contained wittiin tfiese clones were sequenced. See Sections IV A3, and rVA4. The homologies 
between the four independent clones are shown in Rg. 2, where the homologies are indicated by tiie 
vertical lines. Sequences of nucleotides present uniquely in clones 5-1-1. 81. and 91 are indicated by small 
letters. 

The cloned cDNAs present in recombinant phages in clones 5-1-1, 81, 1-2. and 91 are highly 
homologous, and differ in only two regions. Rrst nucleotide number 67 in clone 1-2 is a thymidine, whereas 
tiie other three clones contain a cytidine residue in tills position. This substitution, however, does not alter 
the nature of the encoded arnino acid. , 

The second difference between tiie clones is that clone 5-1-1 contains 28 base pairs at its 5 -terminus 
which are not present in the otiier clones. The extra sequence may be a 5'-terminal cloning artifact; 5 - 
tenninal cloning artifacts are commonly observed in tiie products of cDNA metiiods, 

Syntiietic sequences derived from tiie 5'-region and tiie 3'-region of the HCV cDNA in clone 81 were 
used to screen and isolate cDNAs from tiie lambda-gtll NANBV cDNA library, which overiapped clone 81 
cDNA (Section IV A5.). The sequences of tiie resulting cDNAs, which are in clone 36 and clone 32, 
respectively, are shown in Rg. 5 and Rg. 7. ^ 

Similariy. a synthetic polynucleotide based on tiie 5 -region of clone 36 was used to screen and isolate 
cDNAs from the lambda gt-11 NANBV cDNA Tibrary which overlapped clone 36 cDNA (Section IV.A.8.). A 
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purifiecl cione of recombinant phage-containing cDNA which hybridized to the synthetic polynucleotide 
probe was named clone 35 and the NANBV cDNA sequence contained within this clone is shown in Rg. 8. - 
By utilizing the technique of isolating overlapping cDNA sequences;, clones containing additional 
upstream and downstream HCV cDNA sequences have been obtained. The isolation of these clones, is 
5 described infra in Section IV.A^ 

Analysis of the nucleotide sequences of the HCV cDNAs encoded within the isolated clones show that 
the composite cDNA contains one long continuous ORF. Fig. 26 shows the sequence of the composite 
cDNA from these clones, along with the putative HCV polypeptide encoded therein. 

The description of the method to retrieve the cDNA sequences is mostly of historical interest The 
10 resultant sequences (and their complements) are provided herein, and the sequences* or any portion 
thereof, could be prepared using synthetic methods, or by a combination of synthetic methods with retrieval 
of partial sequences using methods similar to those described herein. 

Lambda-gt11 strains replicated from the HCV cDNA library and from clones 5-1-1, 81, 1-2 and 91 have 
been deposited under the terrris of the Budapest Treaty with the American type Culture Collection (ATCC). 
76 12301 Parklawn Dr.. Rockville, Maryland 20852, and have been assigned the following Accession Numbers, 



Iambda-gt11 


ATCC 
No. 


Deposit Daie 


HCV cDNA library 


40394 


1 Dec. 1987 


clone 81 


40388 


17 Nov. 1987 


clone 91 


40389 


17 Nov. 1987 


clone 1-2 


40390 


17 Nov. 1987 


clone 5-1-1 


40391 


18 Nov. 1987 



The designated deposits will be maintained for a period of thirty (30) years from the date of deposit, or 
for five (5) years after the last request for the deposit; or for the enforceable life of the U.S. patent 
whichever is longer. These deposits and other deposited materials mentioned herein are intended for 
convenience only, and are not required to practice the present invention in view of the description here. The 
HCV cDNA sequences in all of the deposited materials are incorporated herein by reference. 

The description above, of "walking" tiie genome by isolating overlapping cDNA sequences from the 
HCV lambda gt-11 library provides one metinod by which cDNAs corresponding to tiie entire HCV genome 
maiy be isolated. However, given the information provided herein, other methods for isolating these cDNAs 
are obvious to one of skill in the art. Some of these methods are described in Section IV.A., infra. 



II.B. Preparation of Viral Polypeptides and Fragments 

The availability of cDNA sequences, either those isolated by utilizing the cDNA sequences in Rgs. 1-32, 
as discussed infra, as well as the cDNA sequences in these figures, permits the construction of expression 
vectors encoding antigenically active regions of the polypeptide encoded in either strand. These antigeni- 
cally active regions may be derived from coat or envelope antigens or from core antigens, including, for 
example, polynucleotide binding proteins, polynucleotide polymerase(s), and otfier viral proteins required for 
the replication and/or assembly of the virus particle. Fragments encoding tiie desired polypeptides are 
derived from the cDNA clones using conventional restriction digestion or by synthetic methods, and are 
ligated into vectors which may, for example, contain portions of fusion sequences such as beta-Galac- 
tosidase or superoxide dismutase (SOD), preferably SOD. Methods and vectors which are useful for the 
production of polypeptides which contain fusion sequences of SOD are described in European Patent Office 
Publication number 0196056, published October 1. 1986. Vectors encoding fusion polypeptides of SOD and 
HCV polypeptides, i.o.. NANBs-i-i, NANBgi. and 01 00-3, which is encoded in a composite of HCV 
cDNAs, are described In Sections IV.B.1. IV.B.2. and IV.B.4. respectively. Any desired portion of tiie HCV 
cDNA containing an open reading frame, in either sense strand, can be obtained as a recombinant 
polypeptide, such as a mature or fusion protein; alternatively, a polypeptide encoded in tine cDNA can be 
provided by chemical synthesis. 

The DNA encoding the desired polypeptide, whetiier In fused or mature form, and whetiier or not 
containing a signal sequence to penmit secretion, may be ligated into expression vectors suitable for any 
convenient host Both eukaryotic and prokaryotic host systems are presently used in forming recorhbinant 
polypeptides^ and a summary of some of tiie more common conti'oi systems and host cell lines is given in 
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Section III A, infra. The polypeptide is then isolated from lysed cells or from the culture medium and 
purified to the extent needed for its intended use. Purification may be by techniques known in the art, for 
example, salt fractionation, chrbmiatography oh ion exchange resins, affinity chrorhatography. centrifugation. 
and the like. See, for example. Methods in Enzymology for a variety of methods for purifying proteins. Such 

5 polypeptides can be used as diagnostics, or those which give rise to neutralizing antibodies may be 
formulated into vaccines. Antibodies raised against these polypeptides can also be used as diagnostics, or 
for passive immunotherapy. In addition, as discussed in Section IIJ* herein below, antibodies to these 
polypeptides are useful for isolating and identifying! HCV particles. 

The HCV antigens may also be isolated from HCV virions. The virions may be grown in HCV infected 

w cells in tissue culture, or in an infected host 



ILC. Preparation of Antigenic Polypeptides and Conjugation with Carrier 

75 An antigenic region of a polypeptide is generally relatively small-typically 8 to 10 amino acids or less 
in length. Fragments of as few as 5 amino acids may characterize an antigenic region. These segments 
may correspond to regions of HCV antigen. Accordingly, using tine cDNAs of HCV as a basis, DNAS 
encoding short segments of HCV polypeptides can be expressed recombinantiy either as fusion proteins, or 
as isolated polypeptides. In addition, short amino acid sequences can be conveniently obtained by chemical 

20 synthesis. In instances wherein \he synthesized polypeptide is correctly configured so as to provide tiie 
correct epitope, but is too small to be immunogenic, the polypeptide may be linked to a suitable carrier. 

A number of techniques for obtaining such linkage are known in the art, including the formation of 
disulfide linkages using N-sucdnimidyl-3-(2-pyridylthio)proplonate (SPDP) and succinimidyl 4-(N-mal- 
eimidomethyl)cyclohexane-1-carboxylate (SMCC) obtained from Pierce Company. Rockford, Illinois, (if tfie 

25 peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue.) These reagents 
create a disulfide linkage between themselves and peptide cysteine residues on one protein and an amide 
linkage through the epsilon-amino on a lysine, or other free amino group in the other. A variety of such 
disulfide/amide-forming agents are known. See, for example, Immun. Rev. (1982) 62:185. Other bifunctional 
coupiing agents form a thioether rather than a disulfide linkage. Many of these thio'^ther-fonming agents are 

30 commercially available and include reactive esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 2- 
iodoacetic acid, 4-(N-maIeimidometiiyl)cyclohexane-1-carboxyIic add, and tiie like. The carboxyl groups can 
be activated by combining tiiem with succinimide or 1-hydroxyl-2-nltro-4-sulfonic acid, sodium salt. The 
foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly be 
used. 

35 Any canier may be used which does not itself induce the production of antibodies harmful to the host. 
Suitable carriers are typically large, slowly metabolized macromolecules such as proteins; polysaccharides, 
such as latex functionalized sepharose, agarose, cellulose, cellulose beads and the like; polymeric amino 
acids, such as polyglutamic acid, polylyslne, and the like; amino acid copolymers; and inactive virus 
particles, see, for example, section 11.D. Especially useful protein substrates are serum albumins, keyhole 

40 limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and otiier proteins 
well lanown to those skilled in the art 

{LP. Preparation of Hybrid Particle Immunogens Containing HCV Epitopes 

45 

The immunogenicity of the epitopes of HCV may also be enhanced by preparing them in mammalian or 
yeast systems fused with or assembled with particle-fonnning proteins such as, for example, that associated 
with hepatitis B surface antigen. Constructs wherein the NANBV epitope is linked directiy to the particle- 
forming protein coding sequences produce hybrids which are immunogenic with respect to tine HCV 

50 epitope. In addition, all of the vectors prepared include epitopes specific to HBV, having various degrees of 
immunogenicity, such as. for example, the pre-S peptide. Thus, particles constructed from particle forming 
protein which include HCV sequences are immunogenic with respect to HCV and HBV, 

Hepatitis surface antigen (HBSAg) has been shown to be formed and assembled into particles in S. 
cerevisiae (Valenzuela et al. (1982)). as well as in. for example, mammalian cells (Valenzuela, P.. et aL 

55 (1984)). The formation of such particles has been shown to enhance the immunogenicity of the monomer 
subunit The constmcts may also include the immunodominant epitope of HBSAg, comprising tiie 55 amino 
acids of the presurface (pre-S) region. Neurath et al. (1984). Constructs of the pre-S-HBSAg particle 
expressible in yeast are disclosed in EPO 174,444, published March 19, 1986; hybrids including heterolo- 
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gous viral sequences for yeast expression are disclosed in EPO 175,261 » published March 26. 1966, Both 
applications are assigned to the herein assignee, and are incorporated herein by reference. These 
constructs may also be expressed in mammalian ceils such as Chinese hamster ovary (CHO) cells using an 
SV40-dihydrofolate reductase vector (Michelle et ai. (1984)). 
5 In addition, portions of the particle-fonming protein coding sequence may be replaced with codons 
encoding an HCV epitope. In this replacement, regions which are not required to mediate the aggregation of 
the units to form immunogenic particles in yeast or mammals can be deleted, thus eliminating additional 
HBV antigenic sites from competition with the HCV epitope. 

10 

ll.E. Preparation of Vaccines 

Vaccines may be prepared from one or more immunogenic polypeptides derived from HCV cDNA as 
well as from the cDNA sequences in the Rgs. 1-32. or from tiie HCV genome to which tiiey correspond. 

75 The observed homology between HCV and Flaviviruses provides information concerning the polypeptides 
which are likely to be most effective as vaccines, as well as the regions of tiie genome in which they are 
encoded. The general structure of the Ravivirus genome is discussed in Rice et al (1986). The flavivirus 
genomic RNA is believed to be the only vims-specific mRNA species, and it is translated into tiie three viral 
structural proteins, i.e., C, M, and E. as well as two large nonstructural proteins, NV4 and NV5. and a 

20 complex set of smaller nonstructural proteins. It is known that major neutralizing epitopes for Raviviruses 
reside in tiie E (envelope) protein (Roehrig (1986)). The corresponding HCV E gene and polypeptide 
encoding region can be predicted, based upon the homology to Raviviruses. Thus, vaccines may be 
comprised of recombinant polypeptides containing epitopes of HCV E. These polypeptides may be 
expressed in bacteria, yeast, or mammalian cells, or alternatively may be isolated from viral preparations. It 

25 is also anticipated that the other structural proteins may also contain" epitopes which give rise to protective 
anti-HCV antibodies. Thus, polypeptides containing the epitopes of E, C, and M may also be used, whetiier 
singly or in combination, in HCV vaccines. 

In addition to the above, it has been shown that immunization with NS1 (nonstructural protein 1), results 
in protection against yellow fever (Schlesinger et al (1986)). This is true even though tiie immunization does 

30 riot give rise to neutralizing antibodies. Thus, particularly since this protein appears to be highly conserved 
among Raviviruses, it is likely that HCV NSl will also be protective against HCV infection. Moreover, it also 
shows that nonstructural proteins may provide protection against viral pathogenicity, even if they do not 
cause the production of neutralizing antibodies. 

In view of the above, multivalent vaccines against HCV may be comprised of one or more structural 

35 proteins, and/or one or more nonstructural proteins. These vaccines may be comprised of, for example, 
recombinant HCV polypeptides and/or polypeptides isolated from the virions. In addition, it may be possible 
to use inactivated HCV in vaccines; inactivation may be by the preparation of viral lysates, or by other 
means known in the art to cause inactivation of Flaviviruses, for example, treatment with organic solvents or 
detergents, or treatment with formalin. Moreover, vaccines may also be prepared from attenuated HCV 

40 strains. The preparation of attenuated HCV strains is described infra. 

It is known ti^at some of the proteins in Raviviruses contain highly conserved regions, thus, some 
immunological cross-reactivity is expected between HCV and other Raviviruses. It is possible that shared 
epitopes between tiie Flaviviruses and HCV will give rise to protective antibodies against one or more of the 
disorders caused by these pathogenic agents. Thus, it may be possible to design multipurpose vaccines 

45 based upon this knowledge. 

The preparation of vaccines which contain an immunogenic polypeptide(s) as active ingredients, is 
known to one skilled In tiie art. Typically, such vaccines are prepared as injectables. either as liquid 
solutions or suspensions; solid forms suitable for solution In, or suspension in. liquid prior to injection may 
also be prepared. The preparation may also be emulsified, or the protein encapsulated in liposomes. The 

50 active immunogenic ingredients are often mixed with exciplents which are pharmaceutically acceptable and 
compatible with the active ingredient Suitable exciplents are, for example, water, saline, dextrose, glycerol, 
ethanol. or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts 
of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants which 
enhance the effectiveness of the vaccine. Examples of adjuvants which may be effective include but are not 

55 limited to: aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor- 
muramyl-L-alanyl-D-isoglutamine (CGP 11637. refen^ed to as nor-MDP). N-acetylmuramyl-L-alanyl-D- 
isoglutaminyl-L-alanine-2-(1 '-2'-dipalmitoy!-sn-glycero-3-hydroxyphosphoryloxy)-ethyIamine (CGP 1 9835A. 
refenred to as MTP-PE). and RIBI, which contains three components extracted from bacteria, mon- 
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ophosphoryl lipid A. trehalose dimycolate and cell wall skeleton (MPL + TDM + CWS) in a 2% 
squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be determined by measuring the 
amount of antibodies directed against an immunogenic polypeptide containing an HCV antigenic sequence 
resulting from administration of this polypeptide in vaccines which are also comprised of tine various 

5 adjuvants. — 

The vaccines are conventionally administered parenterally, by injection, for example, either sub- 
cutaheously or intramuscularly. Additional formulations which are suitable for other modes of administration 
include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and 
carriers may include, for example, polyalkylene glycols or triglycerides; suph suppositories may be formed 

70 from mixtures containing the active ingredient in the range of 0.5% to 10%, preferably 1%-2%. Oral 
formulations include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the 
like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release 
fonnr^ulations or powders and contain 10%-95% of active ingredient preferably 25%-70%. 

75 The proteins may be formulated into the vaccine as neutral or salt forms. Pharmaceutically acceptable 
salts include the acid addition salts (formed with free amino groups of the peptide) arKi which are fomned 
with inorganic acids such as. for example, hydrochloric or phosphoric acids, or such organic acids such as 
acetic, oxalic, tartaric, maleic. and tiie like. Salts formed with tiie free carboxyl groups may also be derived 
from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, 

20 and such organic bases as isopropylamine, trimethylamine, 2-ethyiamino ethanol, histidine, proc^ne, and 
the like. 



ILF. Dosage and Administration of Vaccines 

25 

The vaccines are administered in a manner compatible with tiie dosage formulation, and in such 
amount as will be prophylactically and/or therapeutically effective. The quantity to be administered, which Is 
generally in tiie range of 5 micrograms to 250 micrograms of antigen per dose, depends on the subject to 
be treated, capacity of the subject's immune system to synthesize antibodies, and the degree of protection 
30 desired. Precise amourtts of active ingredient required to be administered may depend on the judgment of 
the practitioner and may be peculiar to each subject 

The vaccine may be given in a single dose schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course of vaccination may be with 1-10 separate doses, 
followed by otiier doses given at subsequent time intervals required to maintain and or reenforce the 
35 immune response, for example, at 1-4 months for a second dose, and if needed, a subsequent dose{s) after 
several months. The dosage regimen will also, at least in part, be determined by tiie need of tiie individual 
and be dependent upon tiie judgment of the practitioner. 

In addition, tiie vaccine containing the immunogenic HCV antigen(s) may be administered in conjunction 
with other immunoregulatory agents, for example, immune globulins. 

40 

II.G. Preparation of Antibodies Against HCV Epitopes 

The immunogenic polypeptides prepared as described above are used to produce antibodies, both 
45 polyclonal and monoclonal. If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit 
goat horse, etc.) is immunized with an immunogenic polypeprtide bearing an HCV epitope(s). Serum from 
tiie immunized animal is collected and treated according to known procedures. If serum containing 
polyclonal antibodies to an HCV epitope contains antibodies to other antigens, the polyclonal antibodies can 
be purified by immunoaffinity chromatography. Techniques for producing and processing polyclonal 
50 antisera are known in the art see for example, Mayer and Walker (1987). 

Altematively, polyclonal antibodies may be isolated from a mammal which has been previously infected 
witfi HCV. An example of a metiiod for purifying antibodies to HCV epitopes from serum from an infected 
individual, based upon affinity chromatography and utilizing a fusion polypeptide of SOD and a polypeptide 
encoded within cDNA clone 5-1-1, is presented in Section V.E. 
55 Monoclonal antibodies directed against HCV epitopes can also be readily produced by one skilled in 
the art The general metiiodology for making monoclonal antibodies by hybridomas is well known. Immortal 
antibody-producing cell lines can be created by cell fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA. or transfection with Epsteln-Barr virus. See, e.g., M. 
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Schreier et al. (1980); Hammerling et al. (1981); Kennett et aL.(1980); also. U.S. Patent Nos. 4.341,761; 
4.399,121; 4.427,783; 4,444.887; 4.466.917; 4.472.500; 4.491.632; and 4,493,890. Panels of monoclonal 
antibodies produced against HCV epitopes can be screened for various properties; i.e., for isotype. epitope 
affinity, etc. 

5 Antibodies, both monoclonal and polyclonal, which are directed against HCV epitopes are particularly 
useful in diagnosis, and those which are neutralizing are useful in passive immunotherapy. Monoclonal 
antibodies, in particular, may be used to raise anti-idiotype antibodies. 

Anti-ldiotype antibodies are immunoglobulins which carry an "intemal image" of the antigen of the 
infectious agent against which protection Is desired. See, for example. Nisonoff. A., et al. (1981) and 
10 Dreesman et al. (1985). 

Techniques for raising anti-idiotype antibodies are known In the art. See, for example, Grzych (1985), 
MacNamara et at. (1984), and Uytdehaag et al. (1985). These anti-idiotype antibodies may also be useful for 
treatment of NANBH. as well as for an elucidation of the immunogenic regions of HCV antigens. 

75 

II.H. Diagnostic Oligonucleotide Probes and Kits 

Using the disclosed portions of the isolated HCV cDNAs as a basis, including those in Figs. 1-32, 
oligomers of approximately 8 nucleotides or more can be prepared, either by excision or synthetically, 

20 which hybridize with the HCV genome and are useful in identification of the viral agent{s), further 
characterization of the viral genome(s), as well as in detection of the virus(es) in diseased individuals. The 
probes for HCV polynucleotides (natural or derived) are a length which allows the detection of unique viral 
sequences by hybridization. While 6-8 nucleotides may be a workable length, sequences of 10-12 
nucleotides are preferred, and about 20 nucleotides appears optimal. Preferably, these sequences will 

25 derive from regions which lack heterogeneity. These probes can be prepared using routine methods^ 
including automated oligonucleotide syntiietic methods. Among useful probes, for example, are the clone 5- 
1-1 and the additional clones disclosed herein, as well as the various oligomers useful in probing cDNA 
libraries, set fortii below. A complement to any unique portion of the HCV genome will be satisfactory. For- 
use as probes, complete complementarity is desirable, though it may be unnecessary as the length of the 

30 fragment is increased. 

For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, is 
treated, if desired, to extract the nucleic acids contained therein. The resulting nucleic acid from the sample 
may be subjected to gel electrophoresis or other size separation techniques; alternatively, the nucleic acid 
sample may be dot blotted without size separation. The probes are then labeled. Suitable lat>eis. and 

35 methods for labeling probes are known in the art, and include, for example, radioactive labels incorporated 
by nick translation or kinasing. biotin, fluorescent probes, and chemiluminescent probes. The nucleic acids 
extracted from the sample are then treated with tiie labeled probe under hybridization conditions of suitable 
stringencies. 

The probes can be made completely complementary to the HCV genome. Therefore, usually high 

40 Stringency conditions are desirable in order to prevent false positives. However, conditions of high 
stringency should only be used if the probes are complementary to regions of the viral genome which lack 
heterogeneity. The stringency of hybridization is determined by a number of factors during hybridization 
^ and during the washing procedure, including temperature, ionic strength, length of time, and conceritration 
of formamide. These factors are outlined in, for example, Maniatis, T. (1 982). 

45 Generally, it Is expected that the HCV genome sequences will be present in serum of infected 
individuals at relatively low levels, i.e., at approximately lO^-IO^ sequences per ml. This level may require 
that amplification techniques be used in hybridization assays. Such techniques are known in the art For 
example, tiie Enzo Biochemical Corporation "Bio-Bridge" system uses terminal deoxynucleotlde transferase 
to add unmodified s'-poly-dT-taiis to a DNA probe. The poly dT-talled probe is hybridized to the target 

50 nucleotide sequence, and tiien to a biotin-modified poly-A, PCT application 84/03520 and EPA1 24221 
describe a DNA hybridization assay in which: (1) analyte is annealed to a single-stranded DNA probe that is 
complementary to an enzyme-labeled oligonucleotide; and (2) the resulting tailed duplex is hybridized to an 
enzyme-labeled oligonucleotide. EPA 204510 descrik)es a DNA hybridization assay in which analyte DNA is 
contacted with a probe that has a tail, such as a poly-dT tail, an amplifier strand that has a sequence that 

55 hybridizes to the tail of the probe, such as a poly-A sequence, and which is capable of binding a plurality of 
labeled strands. A particulariy desirable technique may first involve amplification of the target HCV 
sequences in sera approximately 10.000 fold. i.e., to approximately 10^ sequences/ml. This may be 
accomplished, for example, by the technique of Saiki et al. (1986). The amplified sequence(s) may then be 
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detected using a hybridization assay which is described in co-pending U.S. Application, Attorney Docket 
No. 2300-0171. which was filed 15 Octoi:>er 1987, is assigned to the herein assignee, and is hereby 
incorporated herein by reference. This hybridization assay, which should detect sequences at the level of 
lO^ ml utilizes nucleic acid muitimers which bind to single-stranded analyte nucleic acid, and which also 
bind to a muttipiidty of single-stranded labeled oligonucleotides. A suitable solution phase sandwich assay 
which may be used with labeled polynucleotide probes, and the methods for the preparation of probes is 
described In EPO 225.807, published June 16, 1987. which is assigned to the herein assignee, and which is 
hereby incorporated herein by reference. 

The probes can be packaged into diagnostic kits. Diagnostic kits include the probe DNA, which may be 
labeled; alternatively, the probe DNA may be unlabeled and the ingredients for labeling may be included in 
the kit. The kit may also contain other suitably packaged reagents and materials needed for the particular 
hybridization protocol, for example, standards, as well as Instnictions for conducting the test 



ILL Immunoassay and Diagnostic Kits 

Both the polypeptides which react immunologically with serum containing HCV antibodies, for example, 
those derived from or encoded within the clones described in Section IV.A., and composites thereof, (see 
section IV A) and the antibodies raised against the HCV specific epitopes in these polypeptides, see for 
example Section IV.E, are useful in immunoassays to detect presence of HCV antibodies, or the preserice 
of the virus and/or viral antigens, in biological samples, including for example, blood or serum samples. 
Design of the Immunoassays is subject to a great deal of variation, and a variety of these are known in the 
art. For example, the immunoassay may utilize one viral antigen, for example, a polypeptide derived from 
any of the clones containing HCV cDNA described in Section IV A. or from the composite cDNAs derived 
from the cDNAs in these ctones, or from the HCV genome from which the cDN A in these clones is derived; 
alternatively, the immunoassay may use a combination of viral antigens derived from these sources. It may 
use. for example, a monoclonal antibody directed towards a viral epitope(s), a combination of monoclonal 
antibodies directed towards one viral antigen, monoclonal antibodies directed towards different viral 
antigens, polyclonal antibodies directed towards the same viral antigen, or polyclonal antibodies directed 
towards different viral antigens. Protocols may be based, for example, upon competition, or direct reaction, 
or sandwich type assays. Protocols may also, for example, use solid supports, or may be by im- 
munoprecipifadon. Most assays involve the use of labeled antibody or polypeptide; the labels may be. for 
example, fluorescent, chemifuminescent. radioactive, or dye molecules. Assays which amplify the signals 
from the probe are also known; examples of which are assays which utilize biotin and avidin. and enzyme- 
labeled and mediated immunoassays, such as ELISA assays. 

The Ravivirus model for HCV allows predictions regarding the likely location of diagnostic epitopes for 
the virion structural proteins. The C. pre-M, M. and E domains are all likely to contain epitopes of significant 
potential for detecting viral antigens, and particulariy for diagnosis. Similarly, domains of the nonstructural 
proteins are expected to contain important diagnostic epitopes (e.g.. NS5 encoding a putative polymerase; 
and NS1 encoding a putative complement-binding antigen). Recombinant polypeptides, or viral polypep- 
tides, which include epitopes from these specific domains may be useful for the detection of viral antibodies 
In infections blood donors and Infected patients. 

- In addition, antibodies directed against the E and/or M proteins can be used in immunoassays for the 
detection of viral antigens in patients with HCV caused NANBH, and in infectious blood donors. Moreover, 
these antibodies will be extremely useful in detecting acute-phase donors and patients. 

Kits suitable for immunodtagnosis and containing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, Inckjding the polypeptides of the invention containing HCV epitopes or 
antibodies directed against HCV epitopes in suitable containers, along with the remaining reagents and 
materials required for the conduct of the assay, as well as a suitable set of assay instiictions. 

ILJ. Further Characterization of the. HCV Genome, Virions, and Viral Antigens Usjng Probes Derived From 
cDNA to the Viral Genome 

The HCV cDNA sequence infonnation in the clones described in Section IV.A., as shown In Rgs. 1-32, 
inclusive, may be used to gain further information on the sequence of the HCV genome, and for 
identification and isolation of the HCV agent, and thus will aid in Its characterization including the nature of 
the genome, the structure of the viral particle, and the nature of the antigens of which it is composed. This 
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information, in turn, can lead to additional polynucleotide probes, polypeptides derived from the HCV 
genome, and antibodies directed against HCV epitopes, which would be useful for the diagnosis and/or 
treatment of HCV caused NANBH. 

The cDNA sequence information in the above-mentioned clones is useful for the design of probes for 

5 the isolation of additional cDNA sequences which are derived from as yet undefined regions of the HCV 
genome(s) from which the cDNAs in clones described in Section IVA are derived. For example, labeled 
probes containing a sequence of approximately 8 or more nucleotides, and preferably 20 or more 
nucleotides, which are derived from regions close to the s'-termini or s'-tennini of the family of HCV cDNA 
sequences shown in Rgs. 1, 3, 6. 9, 14 and 32 may be used to isolate overlapping cDNA sequences from 

10 HCV cDNA libraries. These sequences which overlap the cDNAs in the above-mentioned clones, but which 
also contain sequences derived from regions of the genome from which the cDNA in the above mentioned 
clones are not derived, may then be used to synthesize probes for identification of other overlapping 
fragments which do not necessarily overtap the cDNAs in the clones described in Section IV^. Unless the 
HCV genome is segmented and the segments lack common sequences, it is possible to sequence ihe 

15 entire viral genome(s) utilizing the technique of isolation of overlapping cDNAs derived from tfie viral 
genome(s). Although it is unlikely, if the genonne is a segmented genome which lacks common sequences, 
the sequence of the genome can be detennf\ined by serologically screening lambda-gtll HCV cDNA 
libraries, as used to isolate clone 5-1-1, sequencing cDNA isolates, and using the isolated cDNAs to isolate 
overiapping fragments, using the technique described for the isolation and sequencing of the clones 

20 described in Section IVA Alternatively, characterization of the genomic segments could be from the viral 
genome(s) isolated from purified HCV particles. Methods for purifying HCV particles and for detecting them 
during the purification procedure are described herein, infra. Procedures for isolating polynuclekJtide 
genomes from viraf particles are known in the art, and one procedure which may be used is shown in 
EKample IV,A.1. The isolated genomic segments could then be cloned and sequenced. Thus, witii the 

25 infomnation provided herein, it is possible to clone and sequence the HCV genome(s) irrespective of their 
nature. 

Methods for constructing cDNA libraries are known in the art, and are discussed supra and infra; a 
method for the construction of HCV cDNA libraries in lambda-gtll is discussed infra in Section IVA 
However, cDNA libraries which are useful for screening wrtii nucleic acid probes may also be constructed in.. 

30 other vectors l<nown in tiie art, for example. iambda-gtIO (Huynh et al. (1985)). The HCV derived cDNA 
detected by the probes derived from the cDNAs in Rgs. 1-32. and from tiie probes synthesized from 
polynucleotides derived from these cDNAs, may be isolated from the clone by digestion of the isolated 
polynucleotide with the appropriate restriction enzyme(s), and sequenced. See, for example, Section IV.A.3. 
and IV A4. for the techniques used for tiie isolation and sequencing of HCV cDNA which overlaps HCV 

35 cDNA in clone 5-1-1, Sections IV.A.5-IVA7 for the isolation and sequencing of HCV cDNA which overlaps 
tiiat in clone 81 . and Section IV.A.8 and IV A9 for the isolation and sequencing of a clone which overiaps 
another clone (clone 36), which overiaps clone 81. 

The sequence information derived from tiiese overiapping HCV cDNAs is useful for determining areas 
of homology and heterogeneity within the viral genome(s), which could indicate the presence of different 

40 Strains of the genome, and/or of populations of defective particles. It is also useful for the design of 
hybridization probes to detect HCV or HCV antigens or HCV nucleic acids in biological samples, and during 
the isolation of HCV (discussed infra), utilizing the techniques described in Section II.G. Moreover, th^ 
overlapping cDNAs may be used to create expression vectors for polypeptides derived from the HCV 
genome(s) which also encode the polypeptides encoded in clones 5-1-1, 36. 81. 91, and 1-2, and in the 

45 other clones described in Section IVA The techniques for the creation of these polypeptides containing 
HCV epitopes, and for antibodies directed against HCV epitopes contained within tiiem, as well as their 
uses, are analogous to those described for polypeptides derived from NANBV cDNA sequences contained 
witiiin clones 5-1-1, 32, 35, 36, 1-2, 81, and 91 , discussed supra and infra. 

Encoded within the family of cDNA sequences contained within clones 5-1-1, 32. 35, 36, 81. 91, 1-2, 

50 and tiie other clones described in Section IV.A. are antigen(s) containing epitopes which appear tp be 
unique to HCV; i.e., antibodies directed against these antigens are absent from individuals infected witii 
HAV or HBV. and from individuals not infected with HCV (see the serological data presented in Section 
IV.B.). Moreover, a comparison of the sequence information of these cDNAs with the sequences of HAV. 
HBV. HDV, and with tiie genomic sequences in Genebank indicates that minima! homology exists between 

55 these cDNAs and tiie polynucleotide sequences of tiiose sources. Thus, antibodies directed against the 
antigens encoded witiiin the cDNAs of these clones may be used to identify BB-NANBV particles isolated 
from infected Individuals. In addition, they are also useful for the isolation of NANBH agent(s). 

HCV particigs may be isolated from the sera from BB-NANBV infected individuals or from cell cultures 

21 
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by any of the methods known in the art, including for example, techniques based on size discrimination 
such as sedimentation or exclusion methods, or techniques based on density such as ultracentrifugation in 
density gradients, or precipitation with agents such as polyethylene glycol, or chromatography on a variety 
of materials such as anionic or catibnic exchange materials, and materials which bind due to hydrophobic- 

5 ity. as well as affinity columns. During the isolation procedure the presence of HCV may be detected by 
hybridization analysis of the extracted genome, using probes derived from the HCV cDNAs described 
supra, or by immunoassay (see Section ILL) utilizirig as probes antibodies directed against HCV antigens 
encoded within the family of cDNA sequences shown in Rgs, 1-32, and also directed against HCV antigens 
encoded within the overiapping HCV cDNA sequences discussed supra. The antibodies rnay be mon- 

70 oclonal, or polyclonal, and it may be desirable to purify the antibodies before their use in the immunoassay. 
A purification procedure for polyclonal antibodies directed against antigen(s) encoded within clone 5-1-1 is 
described in Section IV.E; analogous purification procedures may be utilized for antibodies directed against 
other HCV antigens. 

Antibodies directed against HCV antigens encoded within the family of cDNAs shown in Rgis. 1-32, as 
75 well as tiiose encoded witiiin overlapping HCV cDNAs, whidi are affixed to solid supports are useful for the 
isolation of HCV by immunoaffinity chromatography. Techniques for immunoaffinity chromatography are 
known in the art including techniques for affixing antibodies to solid supports so that they retain their 
immunoselective activity; the techniques may be those in which the antibodies are adsorbed to the support 
(see. for example, Kurstak iri ENZYME IMMUNODIAGNOSIS. page 31-37), as well as those in which the 
20 antibodies are covalentiy linked to the support. Generally, tiie techniques are similar to those used for 
cbvalent linking of antigens to a solid support, which are generally described in Section ILC; however, 
spacer groups may be included in the bifurictional coupling agents so that the antigen binding site of the 
antibody remains accessible. 

During the purification procedure the presence of HCV may be detected and/or verified by nucleic acid 
25 hybridization, utilizing as probes polynucleotides derived from the family of HCV cDNA sequences shown in 
Figs. 1-32, as well as from overiapping HCV cDNA sequences, described supra. In this case, the fractions 
are treated under conditions which would cause the disruption of viral particles, for example, with detergents 
in the presence of chelating agents, and the presence of viral nucleic acid determined by hybridization 
techniques described in Section ILH. Further confirmation that the isolated particles are. tiie agents which 
30 induce HCV may be obtained by infecting chimpanzees with the isolated virus particles, followed by a 
determination of whether tiie symptoms of NANBH result from the infection. 

Viral particles from the purified preparations may then be further characterized. The genomic nucleic 
acid has been puritied. Based upon its sensitivity to RNase, and not DNase I, it appears that the virus is 
composed of an RNA genome. See Example IV.C.2.. infra. The strandedness and circularity or non- 
35 circularity can determined by techniques known in the art. including, for example, its visualization by 
electron microscopy, its migration in density gradients, and its sedimentation characteristics. Based upon 
the hybridization of the captured HCV genome to the negative strands of HCV cDNAs. it appears that HCV 
may be comprised of a positive stranded RNA genome (see Section IV.H.l). Techniques such as these are 
described in, for example, METHODS IN ENZYMOLOGY. In addition, tiie purified nucleic acid can be 
40 cloned and sequenced by known techniques, including reverse transcription since tiie genomic material is 
RNA. See, for example, Maniatis (1982), and Glover (1985). Utilizing tiie nucleic acid derived from the viral 
particles, it is possible to sequence the entire genome, whether or not it is segmented. 
A Examination of the homology of tiie polypeptide encoded within the continuous ORF of combined 
clones 14i through 39c (see Rg. 26), shows that the HCV polypeptide contains regions of homology with 
45 the corresponding proteins in conserved regions of flaviviruses. An example of this is described in Section 
IV.H.3. This finding has many important ramifications. First, this evidence. In conjunction with the results 
which show tiiat HCV contains a positive-stranded genome, the size of which is approximately 10,000 
nucleotides, is consistent witii the suggestion that HCV is a f lavivirus, or fiavi-like virus. Generally, fiavivirus 
virions and their genomes have a relatively consistent structure and organization, which are known. See 
50 Rice et al. (1986), and Brinton, MA (1988). Thus, the structural genes encoding the polypeptides C, pre- 
M/M. and E may be located in tiie s'-terminus of tiie genome upstream of clone 14i. Moreover, using the 
comparison with other flaviviruses, predictions as to tiie precise location of the sequences encoding these 
proteins can be made. 

isolation of tiie sequences upstream of those in clone 141 may be accomplished in a number of ways 
55 which, given the information herein, would be obvious to one of skill in the art For example, the genome 
"walking" technique, may be used to isolate other sequences which are 5' to those in clone 14i, but which 
overiap that clone; this in turn leads to the isolation of additional sequences. This technique has been amply 
demonstrated infra, in Section IV.A. For example, also, it is known that the flaviviruses have conserved 
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epitopes and regions of conserved nucleic acid sequences. Polynucleotides containing the conserved 
sequences may be used as probes which bind the HCV genome, thus allowing its. isolation. In addition, 
these conserved sequences, in conjunction with those derived from the HCV cDNAs shown In Rg. 22. may 
be used to design primers for use in systems which amplify the genome sequences upstream of those in 

5 clone 141, using polymerase chain reaction technology. An example of this is described infra. 

The structure of the HCV may also be determined and its componerits isolated. The morphology and 
size may be determined by, for exanriple^ electron microscopy. The identification and iocalization of specific 
virat polypeptide antigens such as coat or envelope antigens, or internal antigens, such as nucleic acid 
binding proteins, core antigens, and polynucleotide polymerase(s) may also be detenmined by, for example, 

10 detemnining whetiier the antigens are present as major or minor viral components, as well as by utilizing 
antibodies directed against the specific antigens encoded within isolated cDNAs as probes. This infomnation 
is useful in the design of vaccines; for example, it may be preferable to include an exterior antigen in a 
vaccine preparation. Multivalent vaccines may be comprised of. for example, a polypeptide derived from the 
genome encoding a structural protein, for example, E. as well as a polypeptide from another portion of the 

75 genome, for example, a nonstructural or structural polypeptide. 



U.K. Cetl Culture Systems and Ariimal Model Systerns for HCV Replication 

20 The suggestion that HCV is a flavivirus or flavi-like virus also provides information on methods- for 
growing HCV. The term "flavi-like" means tiiat the virus shows a significant amount of homology to the 
known conserved regions of flaviviruses and that the majority of the genome is a single ORF. Methods for 
culturing flaviviruses are known to tiiose of skill in the art ( See , for example, the reviews by Brinton (1986) 

^ and Stellar, v. (1980)). Generally, suitable cells or cell lines for culturing HCV may include those known to 

25 support Flavivirus replication, for example, the following: monkey kidney cell lines (e.g. MK2. VERO); 
porcine kidney cell lines (e.g. PS); baby hamster kidney cell lines (e.g. BHK); murine macrophage cell lines 
(e.g.. P388D1, MK1, Mml); human macrophage cell lines (e.g., U-937); human peripheral blood leukocytes; 
human adherent monocytes; hepatocytes or hepatocyte cell lines (e.g., HUH7 , HEPG2); embryos or 
embryonic cells (e.g., chick embryo fibroblasts); or cell lines derived from invertebrates, preferably from 

30 insects (e.g. drosophila cell lines), or more preferably from artiiropods, for example, mosquito cell lines 
(e.g., A. Albopictus, Aedes aegypti. Cutex tritaeniorhynchus) or tick cell lines (e.g. RML-14 Dermacentor 
parumapertus). 

ft is possible that primary hepatocytes can be cultured, and tiier> infected witii HCV; or alternatively, the 
hepatocyte cultures could be derived from the livers of infected individuals (e.g.. humans or chimpanzees). 

35 The latter case is an example of a cell which is infected in vivo being passaged in vitro. In addition, various 
immortalization methods can be used to obtain cell-lines' derived from hepatocyte cultures. For example, 
primary liver cultures (before and after enrichment of the hepatocyte population) may be fused to a variety 
of cells to maintain stability. For example, also, cultures may be Infected with transforming viruses, or 
transfected with transforming genes in order to create permanent or semipermanent cell lines. In addition, 

40 for example, cells in liver cultures may be fused to established cell lines (e.g., HepG2 ). Metiiods for cell 
fusion are known in the art, and include, for example, tiie use of fusion agents such as polyethylene glycol, 
Sendai Virus, and Epstein-Barr virus. 

^ As discussed above. HCV is a Ravivirus or Ravi-like virus. Therefore, it is probable tiiat HCV infection 
of cell lines may be accomplished by techniques known in tiie art for infecting cells with Raviviruses. These 

45 include, for example, incubating the cells with viral preparations under conditions which allow viral entry into 
> the cell. In addition, it may be possible to obtain viral production by transfecting the cells with Isolated viral 
polynucleotides. It is known tiiat Togavirus and Ravivirus RNAs are infectious In a variety of vertebrate cell 
lines (Pfeffericom and Shapiro 1974)), and In a mosquito cell line (Peleg (1969)). Metiiods for transfecting 
tissue culture cells with RNA duplexes, positive stranded RNAs, and DMAs (including cDNAs) are known in 

60 tiie art, and include, for example, techniques which use electroporation, and precipitation with DEAE^ 
Dextran or calcium phosphate. An abundant source of HCV RNA can be obtained by performing in vitro 
transcription of an HCV cDNA corresponding to the complete genome. Transfection with this material, or 
wrtii cloned HCV cDNA should result in viral replication and the in vitro propagation of the virus. 

In addition to cultured ceils, animal model systems may be used for viral replication; animal systems in 

55 which flaviviruses are known to those of skill in the art (Siee. for example, the review by Mohatii (1986)). 
Thus. HCV replication may occur not only in chimpanzees, but also in, for example, manmosets and 
suckling mice. 
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ILL. Screening for Anti-Viral Agents for HCV . 

Til© availability of cell culture and animal model systems for HCV also makes possible screening for 
anti-viral agents which inhibit HCV replication, and particularly for those agents which preferentially allow 

5 cell growth and muftiplication while Inhibiting viral replication. These screening methods are; known by those 
of skill in the art Generally* the anti-viral agents are tested at a variety of concentrations, for their effect on 
preventing viral replication in cell culture systems which support viral replication, and tiien for an inhibition 
of infectivity or of viral patinogenicity (and a low level of toxicity) in an animal model system. 

The methods and compositions provided herein for detecting HCV antigens and HCV polynucleotides 

10 are useful for screening of anti-viral agents in that tiiey provide an alternative, and perhaps more sensitive 
means, tor detecting the agent's effect on viral replication than the cell plaque assay or ID50 assay. For 
example, tiie HCV-polyniideotide probes described herein may be used to quantitate tiie amount of viral 
nucleic acid produced in a cell culturis. This could be accomplished, for example, by hybridization or 
competition hybridization of the infected cell nucleic acids with a labeled HCV-polynucleotide probe> For 

75 example, also. anti-HCV antibodies may be used to identify and quantitate HCV antigen(s) in the cell culture 
utilizing the immunoassays described herein. In addition, since it may be desirable to quantitate HCV 
antigens in the infected cell culture by a competition assay, the polypeptides encoded witiiin tiie HCV 
cDNAs described herein are useful in these competition assays. Generally, a recombinant HCV polypeptide 
derived from tiie HCV cDNA would be labeled, and tiie inhibition of binding of this labeled polypeptide to an 

20 HCV polypeptide due to the antigen produced in the cell culture system would be monitored. Moreover, 
tiiese techniques are particularly useful in cases where the HCV may be able to replicate in a cell line 
without causing cell death. 



25 II.M. Preparation of Attenuated Strains of HCV 

In addition to the above, utiiizing tile tissue culture systems and/or animal model systems, it may be 
possible to isolate attenuated strains of HCV. These strains would be suitable for vaccines, or for the 
isolation of viral antigens. Attenuated strains are isolatabie after multiple passages in cell culture and/or an 

30 animal model. Detection of an attenuated sti-an in an infected cell or individual is achievable by techniques 
known in the art and could include, for example, the use of antibodies to one or more epitopes encoded in 
HCV as a probe or tiie use of a polynucleotide containing an HCV sequence of at least about 8 nucleotides 
as a probe. Alternatively, or in addition, an attenuated strain may be constxicted utilizing the genomic 
information of HCV provided herein, and utilizing recombinant techniques. Generally, one would attempt to 

35 delete a region of the genome encoding, for example, a polypeptide related to patiiogenicity, but which 
allows viral replication. In addition, the genome construction would allow the expression of an epitope which 
gives rise to neutralizing antibodies for HCV. The altered genome could tiien be utilized to transform cells 
which allow HCV replication, and tiie cells grown under conditions to allow viral replication. Attenuated HCV 
strains are useful not only for vaccine purposes, but also as sources for tiie commercial production of viral 

40 antigens, since tiie processing of these vimses would require less stringent protection measures for the 
erhployees involved in viral production and/or the production of viral products. 



III. General Methods • 

45 

The general techniques used in extracting the genome from a virus, preparing and probing a oDWK 
library, sequencing clones, constructing expression vectors, transfonming cells, performing immunological 
assays such as radioimmunoassays and ELISA assays, for growing cells in culture, and tiie like are known 
in the art and laboratory manuals are available describing these techniques. However, as a general guide, 
50 tiie following sets forth some sources currentiy available for such procedures, and for materials useful in 
canryirig them out 



III.A. Hosts and Exjpressibh Control Sequences 

Both prdkaryotic and eukaryotic host cells may be used for expression of desired coding sequences 
when appropriate control sequences which are compatible with tiie designated host are used. Among 
prokaryotic hosts, E cofi is most frequentiy used. Expression control sequences for prokaryotes include 
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promoters, optionally containing operator portions, and ribosome binding sites. Transfer vectors conr^patible 
with prokaryotic hosts are commonly derived from, for example, pBR322. a plasmid containing operons 
conferring ampicillin and tetracycline resfistanee. and the various pUC vectors, which also contain se- 
quences conferring antibiotic resistance markers. These markers may be used to obtain successful 

5 transformants by selection. Commonly used prokaryotic control sequences include the Beta-lactamase 
(penicillinase) and lactose promoter systems (Chang et a!., (1977)), the tryptophan (trp) promoter system 
(Goeddel et al. (1980)) and the lambda-derived Pl promoter and N gene ribosome binding site (Shimatake 
et al. (1981)) and the hybrid tac promoter (De Boer et al. (1983)) derived from sequences of the trp and lac 
UV5 promoters. The foregoing systems are particularly compatible with E. coH; if desired, other prokaryotic 

70 hosts such as strains of Bacillus or Pseudomonas may be used, with corresponding control sequences. 

Eukaryotic hosts include yeast and mammalian cells in culture systems- Saccharomyces cerevisiae and 
Saccharomyces carisbergensis are the most commonly used yeast hosts, and are convenient fungal hosts. 
Yeast compatible vectors carry markers which permit selection of successful transformants by conferring 
proto^ophy to auxotrophic mutants or resistance to heavy metals on wild-type strains. Yeast compatible 

75 vectors may employ the 2 micron origin of replication (Broach et al. (1983)). the combination of CEN3 and 
ARS1 or other means for assuring replication, such as sequences vrtiich will result in incorporation of an 
appropriate fragment into the host cell genome. Control sequences for yeast vectors are known in the art 
and include promoters for the synthesis of glycolytic enzymes (Hess et al. (1968); Holland et al. (1976)), 
including the promoter for 3 phosphoglycerate kinase (Hitzeman (1980)). Terminators may also be included, 

20 such as those derived from the enolase gene (Holland (1981)). Particularly useful control systems are those 
which comprise the giyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter or alcohol de- 
hydrogenase (ADH) regulatabie promoter, terminators also derived from GAPDH. and if secretion is desired, 
leader sequence from yeast alpha factor. In addition, the transcriptional regulatory region and the transcrip- 
tional initiation region which are operably linked may be such that they are not naturally associated in the 

25 wild-type organism. These systems are described in detail in EPO 120,551, published October 3. 1984; 
EPO 116,201. published August 22, 1984; and EPO 164,556. published December 18. 1985. alt of which are 
assigned to the herein assignee, and are hereby incorporated herein by reference. 

Mammalian cell lines available as hosts for expression are known in the art and include R)any 
immortalized cell lines available from the American Type Culture Collection (ATCC), including HeLa cells, 

30 Chinese hamster ovary (CHO) cells, baby hamster kidney (BHK) cells, and a number of other cell lines. 
Suitable promoters for mammalian cells are also known in the art and include viral promoters such as that 
from Simian Virus 40 (SV40) (Rers (1978)). Rous sarcoma virus (RSV). adenovirus (ADV)^ and bovine 
papilloma virus (BPV). Mammalian cells may also require terminator sequences and poly A addition 
sequences; enhancer sequences which increase expression may also be included, and sequences which 

35 cause amplification of the gene may also be desirable. These sequences are known in the art Vectors 
suitable for replication in mammalian ceils may include viral replicons? or sequences which insure 
integration of the appropriate sequences encoding NANBV epitopes into the host genome. 

40 IILB. Transformations 

Transformation may be by any known method for introducing polynucleotides into a host ceil, including, 
for example packaging the polynucleotide in a virus and transducing a host cell with the virus, and by direct 
uptake of the polynucleotide. The transformation procedure used depends upon the host to be transformed. 

45 For example, transformation of the E. coli host cells with lambda^gtl 1 containing BB-NANBV sequences is 
discussed in the Example section, infra. Bacterial transformation by direct uptake generally employs 
treatment with calcium or rubidium chloride (Cohen (1972); Maniatis (1982)). Yeast transformation by direct 
uptake may be canied out using the method of Hinnen et al. (1978). Mammalian transformations by direct 
uptake may be conducted using the calcium phosphate predpitation method of Graham and Van der Eb 

50 (1978), or the various known modifications thereof. * 



III.C. Vector Construction 

55 Vector construction employs techniques which are known in the art. Site-specific DNA cleavage is 
performed by treating with suitable restriction enzymes under conditions which generally are specified by 
the manufacturer of these commercially available enzymes. In general, about 1 microgram of plasmid or 
DNA sequence is cleaved by 1 unit of enzyme In about 20 microliters buffer solution by incubation of 1-2 hr 
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at 37° C. After incubation with the restriction enzyme, protein is removed by phenol/chloroform extraction 
and the DNA recovered by precipitation with ethanol. The cleaved fragments may be separated using 
poiyacrylamide or agarose gel electrophoresis techniques, according to the general procedures found in 
Methods in Enzyrhology (1980) 65:489-560. 

5 Sticky ended cleavage fragments rnay be blunt ended using E. coH DNA polymerase I (Klehow) in the 
presence of the appropriate deoxynucieotide triphosphates (dNTPs) present in the miixture. Treatment with 
Si nuclease may also be used, resulting in the hydrolysis of any single stranded DNA portions. 

Ligations are carried out using standard buffer and temperature conditions using T4 DNA tigase and 
ATP; sticky end ligations require less ATP and leiss ligase than blunt end ligations. When vector fragments 

10 are used as part of a ligation mixturia, the vector fragment is often treated with bacterial alkaline 
phosphatase (BAP) or calf intestinal alkaline phosphatase to remove the s'-phosphate and thus prevent 
religation of the vector; alternatively, restrictidh enzyme digestion of unwanted fragments can be used to 
prevent iigatiori. 

Ligation mixtures are transformed into suitable clbhihg hosts, such as E. coli. and successful transfor- 
75 mants selected by, for example, antibiotic resistance, and screened for the con-ect construction. 



1II.D. Construction of Desired DNA Sequences 

20 Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as de- 
scribed by Warner (1984). If desired the synthetic strands may be labeled with by treatment with 
polynucleotide kinase in the presence of ^P-ATP. using standard conditions for the reaction. 

DNA sequences. irK^luding those isolated from cDNA tibraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as described by Zoller (1982), Briefly, the DNA to be 

25 modified is packaged into phage as a single stranded sequence, and converted to a double stranded DNA 
with DNA polymerase using, as a primer, a synthetic oligonucleotide complementary to the portion of the 
DNA to be modified, and having the desired modification included in its own sequence. The resulting 
double stranded DNA is transformed into a phage supporting host bacterium. Cultures of the transfonned 
bacteria, which contain replications of each strand of the phage, are plated in agar to obtain plaques. 

30 Theoretically, 50% of the new plaques contain phage having the mutated sequence, and the remaining 50% 
have the original sequence. Replicates of the plaques are hybridized to labeled synthetic probe at 
temperatures and conditions which permit hybridization with the correct strand, but not with tiie unmodified 
sequence. The sequences which have been identified by hybridization are recovered and cloned. 

35 

I1I.E. Hybridization with Probe 

DNA libraries may be probed using the procedure of Grunstein and Hogness (1975). Briefly, in this 
procedure, the DNA to be probed is immobilized on nitrocellulose filters, denatured, and prehybridized with 

40 a buffer containing 0-50% formamide. 0.75 M NaCI, 75 mM Na citrate, 0.02% (wt/v) each of bovine serum 
albumin, polyvinyl pyrollidone, and Rcoll, 50 mM Na Phosphate (pH 6.5). 0.1% SDS. and 100 
micrograms/ml carrier denatured DNA. The percentage of formamide in the buffer, as well as the time and 
^temperature conditions of the prehybridization and subsequent hybridization steps depends on the strin- 
gency required. Ofigomeric probes which require lower stringency conditions are generally used with low 

45 percentages of formamide. lower temperatures, and longer hybridization times. Probes containing more than 
30 or 40 nucleotides such as tho^ derived from cDNA or genomic sequences generally employ higher 
temperatures, e.g., about 40-42* C, and a high percentage. e.g., 50%, fom^amide. Following prehybridiza- 
tion, 5'-32p.|abeled oligonucleotide probe is added to the buffer, and the filters are incubated in this mixture 
under hybridization conditions. After washing, the treated filters are subjected to autoradiography to show 

so the location of the hybridized probe; DNA in corresponding locations on the original agar plates is used as 
the source of the desired DNA. 



in,F. Verification of Construction and Sequencing 

For routine vector cohstructidhs. ligation^mixtures are transformed Into E. coli strain HB101 or other 
suitable host, and successful transformants selected by antibiotic resistance or other markers. Piasmids 
frorti the tiransfomiants are then prepared according to the nriethod of Clewell et at. (1969), usually following 
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chloramphenico! amplification (Clewed (1972)). The DMA is isolated and analyzed, usually by restriction 
enzyme analysis and/or sequencing. Sequencing may be by iUe dideoxy method of Sanger et al. (1977) as 
further described by Messing et aL (1981). or by the method of Maxam et ai. (1980). Problems with band 
compression, which are sometimes observed in GC rich regions, were overcome by use of T- 
5 deazoguanqsine according to Barr et al. (1986). 



III.G. Enzymes Linked Immunosorbent Assay 

70 The enzyme-linked immunosorbent assay (EUSA) can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of an enzyme to erttier an antigen or an antibody, 
and uses the bound enzyme activity as a quantitative label. To measure antibody, the known antigen is 
fixed to a solid phase (e.g., a mrcropiate or plastic cup), Incubated with test serum dilutions, washed, 
incubated with anti-immunoglobulln labeled with an enzyme, and washed again. Enzymes suitable for 

15 labeling are known In the art. and include, for example, horseradish peroxidase. Enzyme activity bound to 
the solid phase is measured by adding the specific substrate, and determining product formation or 
substrate utilization coiorimetrically. The enzyme activity bound is a direct function of the amount of 
antibody bound. 

To measure antigen, a known specific antibody is fixed tb tiie solid phase, the test material containing 
20 antigen is added, after an Incubation the solid phase is washed, and a second enzyme^labeled antibody is 
added. After wasliing, substrate is added, and enzyme activity is estimated coiorimetrically. and related to 
antigen concentration. 



25 IV. Examples 

Described below are examples of the present invention which are provided only for illustrative purposes, 
and not to limit tiie scope of the present invention. In light of the present disclosure, numerous 
embodiments within the scope of the claims will be apparent to those of ordinary skill in the art. The 
30 procedures set forth, for example, in Sections IV.A. may, if desired, be repeated but need not be, as 
techniques are available for construction of the desired nucleotide sequences based on the information 
provided by the invention. Expression Is exemplified in E. coli; however, other systems are available as set 
fortii more fully in Section IIIA Additional epitopes derived from the genomic structure may also be 
produced, and used to generate antibodies as set forth below. 

35 

NA. Preparation, Isolation and Sequencing of HGV cDNA 

40 1V.A.1. Preparation of HGV cDNA 

The source of NANB agent was a plasma pool derived from a chimpanzee witii chronic NANBH. The 
A, chimpanzee had been experimentally infected with blood from another chimpanzee with chronic NANBH 
resulting from infection with HGV in a contaminated batch of factor 8 concentrate derived from pooled 

45 human sera. The chimpanzee plasma pool was made by combining many individual plasma samples 
containing high levels of alanine aminotransferase activity; tills activity results from hepatic injury due to tiie 
HGV infection. Since 1 ml of a 10~« dilution of tiiis pooled serum given i.v. caused NANBH in anotiier 
chimpanzee, its CID was at least 10^ /ml, i.e., it had a high Infectious virus titer. 

A cDNA library from the high titer plasma pool was generated as follows. Rrst, viral particles were 

so isolated from the plasma; a 90, ml aliquot was diluted with 310 ml of a solution containing 50 mM Tris-HGI, 
pH 8.0, ImM EDTA, 100 mM NaGI. Debris was removed by centrifugation for 20 min at 15,000 x g at 20* C. 
Viral particles in the resulting supernatant were then pelleted by centrifugation in a Beckman SW28 rotor at 
28.000 rpm for 5 hours at 20* C. To release the viral genome, tine particles were disrupted by suspending 
tiie pellets in 15 ml solution containing 1% sodium dodecyl sulfate (SDS), 10 mM EDTA. 10 mM Tris-HGI. 

56 pH 7,5, also containing 2 mg/ml proteinase k, followed by incubation at 45*C for 90 min^ Nucleic acids 
were isolated by adding 0.8 micrograms MS2 bacteriophage RNA as carrier, and extracting the mixture four 
times with a 1:1 mixture of phenolxhloroform (phenol saturated with 0.5M Tris-HGI, pH 7.5. 0.1% (vA^) beta- 
mercaptoethanot, 0.1% (wA^) hydroxyquinoione. followed by extraction two times witti chtorofonm. The 
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aqueous phase was concentrated with 1-butan6l prior to precipitation with 2.5 volumes absolute ethanol 
overnight at -20 'C. Nucleic acid was recovered by centrifugation in a Beckman SW41 rotor at 40.000 rpm 
for 90 min at 4" C. and dissolved in water that had been treated with 0.05% (v/v) diethylpyrocarbonate and 
autoclaved. 

5 Nucleic acid obtained by the above procedure (<2 microgi-ams) was denatured with 17.5 nnM 
CHsHgOH; cDNA was synthesized using this denatured nucleic acid as template, and was cloned into the 
EcoRI site of phage Iambda-gt11 using methods described by Huynh (1985), except that random primers 
replaced oligo(dT) 12-18 during the synthesis of the first cDNA strand by reverse transcriptase (Taylor et aL 
(1976)). The resulting double stranded cDNAs were fractionated according to size on a Sepharose CL-4B 

10 column; eluted material of approximate mean size 400, 300. 200. and 100 base-pairs were pooled into 
cDNA pools 1, 2, 3, and 4, respectively. The lambda-gtll cDNA library was generated from the cDNA in 
poo! 3. 

The lambda-gtll cDNA library generated from pool 3 was screened for epitopes that could bind 
specifically with serum derived from a patient who had previously experienced NANBH. About 10^ phage 

75 were screened with patient sera using the methods of Huynh et al. (1985). except that bound human 
antibody was detected witii sheep anti-human !g antisera that had been radio-labeled with '^^l Five positive 
phages were identified and purified. The five positive phages were then tested for specificity of binding to 
sera from 8 different humans previously infected witii the NANBH agent using the same method. Four of 
the phage encoded a polypeptide that reacted immunologically with only one human serum, i.e.. the one 

20 that was used for primary screening of the phage library. The fifth phage (5-1-1) encoded a polypeptide that 
reacted immunologically witii 5 of 8 of tiie sera tested. Moreover, tills polypeptide did not react im- 
munologically witii sera from 7 normal blood donors. Therefore, it appears that clone 5-1-1 encodes a 
polypeptide which is specifically recognized immunologically by sera from NANB patients. 

25 

IV A.2. Sequences of the HCV cDNA in Recombinant Phage 5-1-1. and of the Polypeptide Encoded Within 
the Sequence. 

The cDNA in recombinant phagp 5-1-1 was sequenced by the method of Sanger et al. (1977). 
30 Essentially, the cDNA was excised witii EcoRI. isolated by size fractionation using gel electrophoresis. The 
EcoRI restriction fragments were subcioned into tiie Ml 3 vectors. mp18 and mp19 (Messing (1983)) and 
sequenced using the dideoxychain termination method of Sanger et al. (1977), The sequence obtained is 
shown in Rg. 1. 

The polypeptide encoded in Rg. 1 that is encoded in tiie HCV cDNA is in the same transiational frame 
35 as the N-terminai beta-galactosidase moiety to which it is fused. As shown in Section IV A., the transiational 
open reading frame (ORF) of 5-1-1 encodes epitope(s) specifically recognized by sera from patients and 
chimpanzees with NANBH infections. 

40 1VA.3. Isolation of Overiapping HCV cDNA to cDNA in Clone 5-1-1. 

Overlapping HCV cDNA to the cDNA in clone 5-1-1 was obtained by screening the same lambda-gtll 
- library, created as described in Section IV.A.I., with a syntiietic polynucleotide derived from tiie sequence of 
the HCV cDNA in clones 5-1-1, as shown in Rg. 1. The sequence of tiie polynucleotide used for screening 
45 was: 

S'-TCC CTT GCT CGA TGT ACQ GTA A(3T GOT GAG AGO ACT CTT CCA TCT CAT OGA ACT CTC GGT 
AGA GGA CTT CCC TGT CAG GT-3 . 

5Q The lambda-gtll library was screened with this probe, using tiie metiiod described in Huynh (1985). 
Approximately 1 in 50,000 clones hybridized witii the probe. Three clones which contained cDNAs which 
hybridized witii the syntiietic probe have been numbered 81, 1-2, and 91. 

55 IVA4. Nucleotide Sequences of Overlapping HCV cDNAs to cDNA in Clone 5-1-1. 

The nucleotide sequences of tiie ttiree cDNAs in clones 81, 1-2, and 91 were determined essentially as 
in Section The sequences of these clones relative to the HCV cDNA sequence in phage 5-1-1 is 
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shown in Rg. 2, which shows the strand encoding the detected HCV epitope, and where the homologies in 
the nucleotide sequences are indicated by vertical lines between the sequences. 

The sequences of the cloned HCV cDNAs are highly homologous in the overlapping regions (see Rg. 
2). However, there are differences in two regions. Nucleotide 67 in clone t-2 is a thymidine, whereas the 
5 other three clones contain a cytidine residue in this position. It should be noted, however, that the same 
amino acid is encoded when either C or T occupies this position. 

The second difference is that clone 5-1-1 contains 28. base , pairs which are not present in the other 
three clones. These base pairs occur at the start of the cDNA sequence in 5-1-1, and are indicated by small 
letters. Based on radioimmunoassay data, which is discussed infra in Section IV.D., it is possible that an 
7a HCV epitope may be encoded in this 28 bp region. 

The absence of the 28 base pairs of 5-1-1 from clones 81, 1-2. and 91 may mean that the cDNA in 
these clones were derived from defective HCV genomes; alternatively, the 28 bp region could be a terminal 
artifact in clone 5-1-1 ; 

The sequences of small letters in the nucleotide sequence of clones 81 and 91 simply indicate that 
7? these sequences have not been found in other cDNAs because cDNAs overlapping these regions were not 
yet isolated. 

A composite HCV cDNA sequence derived from overlapping cDNAs in clones 5-1-1. 81, 1-2 and 91 is 
shown in Rg. 3. However, in this figure the unique 28 base pairs of clone 5-1-1 are omitted: The figure also 
shows the sequence of the polypeptide encoded within the ORF of the composite HCV cDNA. 

20 

IV A5. Isolation of Overlapping HCV cDNAs to cDNA in Clone 81. 

The isolation of HCV cDNA sequences upstream of, and which overlap those in clone 81 cDNA was 
25 accomplished as follows. The lambda-gtll cDNA library prepared as described in Section IVJ\.1. was 
screened by hybridization with a synthetic polynucleotide probe which was homologous to a 5'terminal 
sequence of clone 81. The sequence of clone 81 is presented in Rg. 4. The sequence of the synthetic 
polynucleotide used for screening was: 

30 5' CTQ TCA GQT ATG ATT GCC GGC TTC CCQ GAC 3'. • 

The methods were essentially as described in Huynh (1985). except that the library filters "were given two 
washes under stringent conditions, i.e., the washes were in 5 x SSC, 0.1% SDS at 55* C for 30 minutes 
each. Approximately 1 in 50.000 clones hybridized with the probe. A positive recombinant phage* which 
35 contained cDNA which hybridized with the sequence was isolated and purified. This phage has been 
numbered clone 36. 

Downstream cDNA sequences, which overiaps the carboxyl-end sequences in clone 81 cDNA were 
isolated using a procedure similar to that for the isolation of upstream cDNA sequences, except that a 
synthetic oligonucleotide probe was prepared which is homologous to a s' terminal sequence of clone 81 . 
40 The sequence of the synthetic polynucleotide used for screening was: 

5 TTT GGC TAG TGG TTA GTQ GGC TGG TGA CAG 3 

A positive recombinant phage, which contained cDNA which hybridized with this latter sequence was 
45 isolated and purified, and has been numbered clone 32. 



IVA.6. Nucleotide Sequence of HCV cDNA in Clone 36. 

so The nucleotide sequence of the cDNA in clone 36 was determined essentially as described in Section 

IV.A.2. The double-stranded sequence of this cDNA, its region of overiap with the HCV cDNA in clone 81, 

and the polypeptide encoded by the ORF are shown in Rg. 5. 

The ORF in clone 36 is in the same translational frame as the HCV antigen encoded in clone 81. Thus, 

in combination, the ORFs in clones 36 and 81 encode a polypeptide that represents part of a large HCV 
55 antigen. The sequence of this putative HCV polypeptide and the double stranded DNA sequence encoding 

it. which is derived from the combined ORFs of the HCV cDNAs of clones 36 and 81. is shown in Rg. 6. 
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NAJ Nucleotide Sequences of HCV cDNA in Clone 32 

the nucleotide sequence of the cDNA in clone 32 was determined essentially as was that described in 
Section IV A2 for the sequence of clone 5-1-1. The sequence data indicated that the cDNA in clone 32 

5 recombinant phage was derived from two different sources. One fragment of the cDNA was connprised of 
418 nucleotides derived from the HCV genome; the other fragment was comprised of 172 nucleotides 
derived from the bacteriophage MS2 genome, which had been used as a carrier during the preparation of 
the lambda gtll plasma cDNA library. 

The sequence of the cDNA in clone 32 corresponding to that of the HCV genome is shown in Rg. 7. 

70 The region of the sequences that overlaps that of clone 81, and the polypeptide encoded by the ORF are 
also indicated in the figure, this sequence contains one 'continuous ORF that is in the same translationat 
frame as the HCV antigen encoded by clone 81. 



15 IVA.8 Isolation of Overlapping HCV cDNA to cDNA in Clone 36 

The isolation of HCV cDNA sequences upstiream of, and which overiap those in clone 36 cDNA was 
accomplished as described in Section !V.A.5» for those which overlap clone 81 cDNA, except that the 
synthetic polynucleotide was based on the 5 -region of clone 36. The sequence of the synthetic poiy- 
20 nucleotide used for screening was: 

5' AAG CCA CCG TGT GCG CTA GGG CTC AAG CCC 3 

Approximately 1 in 50.000 clones hybridized with the probe. The isolated, purified clone of recombinant 
25 phage which contained cDNA which hybridized to tills sequence was named clone 35. 



IV^.9 Nucleotide Sequence of HCV cDNA in Clone 35 

30 The nucleotide sequence of the cDNA in clone 35 was determined essentially as described in Section 

IVA.2. The sequence, its region of overiap with that of the cDNA in clone 36, and the putative polypeptide 

encoded therein, are shown in Fig. 8. 

Clone 35 apparently contains a single, continuous ORF tiiat encodes a polypeptide in the same 

trahslational frame as tiiat encoded by clone 36, clone 81. and clone 32. Fig. 9 shows the sequence of the 
35 long continuous ORF that extends through clones 35, 36, 81, and 32, along with the putative HCV 

polypeptide encoded tiierein. This combined sequence has been confirmed using other independent cDNA 

clones derived from tiie same lambda gtl 1 cDNA library. 

IV-A.10. Isolation of Overiapping HCV cDNA to cDNA iii Clone 35 

The Isolation of HCV cDIMA sequences upstream of, and which overiap those in done 35 cDNA was 
accomplished as described In Section IVA,8. for those which overlap clone 36 cDNA. except that the 
synthetic polynucleotide was based oin the 5 -region of clone 35. The sequence of the synthetic poly- 
45 nucleotide used for screening was: 

5' CAG GAT GCT GTC tCC CGC ACT CAA C6T 3' 

Approximately 1 in 50,000 clones hybridized with the probe. The isolated, purified clone of recombinant 
50 phage which contained cDNA which hybridized to this sequence was named clone 37b. 



tVA.11. Nucleotide Sequence of HCV in Clone 37b 

65 The nucleotide sequence of the cDNA in clone 37b was determined essentially as described in Section 
IV A2. The sequence, its region of overiap witii tiiat of the cDNA in clone 35, and the putative polypeptide 
encoded therein, are shown in Rg. 10. 

The 5'-tenminal nucleotide of clone 35 is a T, whereas tiie con^esponding nucleotide in clone 37b is an 
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A. The cDNAs from three other independent clones which were isolated during the procedure in which 
clone 37b was isolated, described in Section IVAlO, Have also been sequenced. The cDNAs from these 
clones also contdn an A in this position. Thus, the s'-terminal T in clone 35 may be an artefact of the 
cloning procedure. It is known that artefacts often arise at the s'^termini of cDNA molecules. 
5 Clone 37b apparently contains one continuous ORF which encodes a polypeptide which is a continu- 
ation of the polypeptide encoded in the ORF which extends through the overiapping clones 35.. 36. 81 and 
32. 



TO IVJV.12 Isolation of Overiapping HCV cDNA to cDNA in Clone 32 

The isolafion of HCV cDNA sequences downstream of clone 32 was accomplished as follows. Rrst, 
clone cla was isolated utilizing a synthetic hybridization probe which was based on the nucleotide sequence 
of the HCV cDNA sequence in clone 32. The method was essentially that described in Section IV.A.5. 
75 except that the sequence of the synthetic probe was: . 

5 AGT GCA GTG GAT GAA CCG GOT GAT AGC CTT 3^ 



Utilizing the nucleotide sequence from clone cla. another synthetic nucleotide was synthesized which 
20 had tiie sequence: 

S' TCC TGA GGC GAC TGC ACC AGT GGA TAA GCT 3'. 

Screening of the lambda gtl 1 library using the clone cla derived sequence as probe yielded approximately 
25 1 in 50,000 positive colonies. An isolated, purified clone which hybridized with this probe was named clone 
33b. 



IVA.13 Nucleotide Sequence of HCV cDNA in Clone 33b 

30 

The nucleotide sequence of the cDNA in clone 33b was determined essentially as described in Section 
IV.A.2; The sequence, its region of overiap with that of the cDNA in clone 32, and the putative polypeptide 
encoded therein, are shown in Rg. 11. 

Clone 33b apparentiy contains one continuous ORF which is an extension of the ORFs in overlapping 
35 clones 37b. 35. 36. 81 and 32. The polypeptide encoded in clone 33b is in the same translational frame as 
that encoded in the extended ORF of these overlapping clones. 



iVAl4 isolation of Overlapping HCV cDNAs to cDNA Clone 37b and to cDNA in Clone 33b 

40 

In order to isolate HCV cDNAs which overlap the cDNAs in clone 37b and in clone 33b, the following 
syntiietic oligonucleotide probes, which were derived from tiie cDNAs in those clones, were used to screen 
the lambda gtll library, using essentially the method described in Section IV.A.3. The probes used were: 

45 5 CAG GAT GCT GTG TCC CGC ACT CAA CGT C 3' 
and 

5' TCC TGA GGC GAC TGC ACC AGT GGA TAA GCT 3' 

to detect colonies containing HCV cDNA sequences which overlap those in clones 37b and 33b, 
so respectively; Approximately 1 in 50.000 colonies were detected with each probe. A clone which contained 
cDNA which w£is upstreahfi of, and which overiappeid the cDNA in clone 37b, was named clone 4db. A clone 
which contained cDNA which was downstream of. and which overtapped the cDNA in clone 33b was named 
clone 25c. 

55 

IVA.15 Nucleotide Sequences of HCV cPNA in clone 40b and In clone 25c 

The nucleotide sequences of tine cDNAs in clone 40b and in clone 25c were detennined essentially as 
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described in Section IVJV.2. The sequences of 40b and 25c. tiieir regions of overlap wHh the cDNAs in 
ciones 37b and 33b, and the putative polypeptides encoded therein, are shown in Rg. 12 (clone 40b) and 
Rg. 13 (clone 25c). 

The s'-terminal nucleotide of clone 40b is a G, However, the cDNAs from five other Independent clones 
5 which were isolated during the procedure in which clone 40b was isolated, described in Section IVA14, 
have also been sequenced. The cDNAs from these clones also contain a T in this position. Thus, the G may 
represent a cloning artifact (see the discussion in Section IV.A.11). 

The 5 -terminus of clone 25c is ACT, but the sequence of this region in clone cla (sequence not 
shown), and in clone 33b is TCA, This difference may also represent a cloning artifact, asf may the 28 extra 
10 5 -tenminal nucleotides in clone 5-1-1. 

Ciones 40b and 25c each apparentiy contain an ORF which is an extension of the continuous ORF in 
the previously sequenced ciones. The nucleotide sequence of the ORF extending through clones 40b, 37b. 
35, 36. 81 . 32, 33b, and 25c. and the amino acid sequence of the putative polypeptide encoded therein, are 
shown in Rg. 14. In the figure, the potential artifacts have been omitted fronr^ the sequence, and instead, the 
75 corresponding sequences in non-5 -terminal regions of multiple overlapping clones are shown. . 



IVA16. Preparation of a Composite HCV cDNA from the cDNAs in Clones 36. 81. and 32 

20 The composite HCV cDNA, CI 00. was constructed as follows. Rrst tiie cDNAs from the clones 36, 81, 
and 32 were excised with EcoRL The EcoRI fragment of cDNA from each clone was cloned individually Into 
the EcoRI site of tiie vector pGEM3-blue (Promega Biotec). The resulting recombinant vectors which 
contained the cDNAs from clones 36. 81, and 32 were named pGEM3-blue/36. pGEM3-blue/81. and 
pGEM3-bIue/32. respectively. The appropriately oriented recombinant of pGEM3-bIue/81 was digested witii 

25 Nael and Nart, and the large (-2850bp) fragment was purified and ligated vwth the small (-570bp) Nael/Narl 
purified restriction fragment from pGEWI3-blue/36. This composite of the cDNAs from ciones 36 and 81 was 
used to generate another pGEWI3-blue vector containing the continuous HCV ORF contained within tiie 
overlapping cDNA within tiiese clones. This new piasmid was tiien digested with Pvull and EcoRI to release 
a fragment of approximately 680bp. which was then ligated witii the srhall (580bp) Pvull/EcoRI fragment 

30 isolated from the appropriately oriented pGEM3-blue/32 plasmid. and .the composite cDNA from clones 36, 
81. and 32 was ligated into the EcoRI linearized vector pSODcfl, which is described in Section IV.B.1, and 
which was used to express clone 5-1-1 in bacteria. Recombinants containing the -1270bp EcoRi fragment of 
composite HCV cDNA (Cioo) were selected, and the cDNA from tiie plasmids was excised with EcoRI and 
purified. 

35 

tV.A.17. Isolation and Nucleotide Sequences of HCV cDNAs in Clones 14i, 11b, 7f, 7e. 8h, 33c, 14c. 8f, 33f, 
33g, and 39c 

40 The HCV cDNAs in clones 14i. lib. 7f. 7e. 8h. 33c, 14c, 8f. 33f. 33g, and 39c were isolated by tiie 
technique of isolating overtyping cDNA fragments from the lambda gtl 1 library of HCV cDNAs described 
in Section IVA1.. The technique used was essentially as described in Section IV.A.3.. except that the 
probes used were designed from Uie nucleotide sequence of the last isolated clones from tiie 5 and the 3' 
end of the combined HCV sequences. The frequency of clones which hybridized with tiie probes described 

45 below was approximately 1 in 50.000 in each case. 

The nucleotide sequences of the HCV cDNAs in clones 14i, 7f, 7e, 8h, 33c, 14c, 8f, 33f. 33g, and 39c 
were determined essentially as described In Section IVA.2.. except tiiat tiie cDNA excised from these 
phages were substituted for the cDNA isolated from clone 5-1-1. 

Clone 33c was isolated using a hybridization probe based on the sequence of nucleotides in clone 40b. 

50 The nucleotide sequence of clone 40b is presented in Rg. 12. The nucleotide sequence of the probe used 
to isolate 33c was: 

5' ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT 3 

55 The sequence of tiie HCV cDNA in clone 33c, and the overiap with tiiat in clone 40b. is shown in Rg. 15, 
which also shows the amino acids encoded therein. 

Clone 8h was isolated using a probe based on the sequence of nucleotides in clone 33c. The 
nucleotide sequence of tiie probe was 
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5 AGA GAG AAC CAT GAG GTC CCC GGT GTT-C 3'. 

The sequence of the HCV cDNA in clone 8h, and the overlap wrth that in clone* 33c» and the amino acids 
5 encoded therein, are shown in Rg. 16. 

Clone 7e was Isolated using a probe based on the sequence of nucleotides in clone 8h. The nucleotide 
sequence of the probe was 

5' TCG GAC CTT TAC CTG GTC ACG AGG CAC 3'. 

The sequence of HCV cDNA in clone 7e, tiie overlap with clone 8h, and the amino acids encoded therein, 
are shown in Fig. 17. 

Clone 14c was isolated with a probe based on the sequence of nucleotides in clone 25c. The sequence 
of clone 25c is shown in Rg. 13. The probe In the isolation of clone 14c had the sequence 

IS 

5 ACC TTC CCC ATt AAT GCC TAC ACC ACG GGC 3. 

The sequence of HCV cDNA in clone 14c, its overlap with that in clone 25c. and the amino acids encoded 
therein are shown In Rg. 18. 
20 Clone 8f was isolated using a probe t>ased on the sequence of nucleotides in clone 14c. The nucleotide 
sequence of the probe was 

5' TCC ATC TCT CAA GGC AAC TTG CaC CGC TAA 3'. 

26 The sequence of HCV cDNA in clone 8f, its overlap with that in clone 14c, and the amino acids encoded 
therein are shown in Rg. 19. 

Clone 33f was isolated using a probe based on the nucleotide sequence present in clone 8f. The 
nucleotide sequence of the probe was 

30 s'TCC ATGGCTGTCCGCTTCCACCTCCAAAGT3'. 

The sequence of HCV cDNA in clone 33f, its overtap with that in clone 8f, and the amino acids encoded 
therein are shown in Rg. 20. 

Clone 33g was isolated using a probe based on the sequence of nucleotides in clone 33f. The 
35 nucleotide sequence of the probe was 

5' GCG ACA AtA CGA CAA CAT CCT CTG AGC CCG 3. 

The sequence of HCV cDNA in clone 33g, its overlap with that in clone 33f. and the amino acids encoded 
40 therein are shown in Rg. 21 . 

Clone 7f was isolated using a probe based on the sequence of nucleotides in clone 7e. The nucleotide 
sequence of the probe was 

5 AGC AGA CAA GGG GCC TCC TAG GGT GCA TAA T 3. 

46 

The sequence of HCV cDNA in clone 7f, its overlap with clone 7e, and the amino acids encoded therein are 
shown in Rg. 22. 

Clone lib was isolated using a probe based on the sequence of clotie 7f. The nucleotide sequence of 
the probe was 

5' CAC CTA TGT TtA TAA CCA TCT CAC TCC TCT 3', 

The sequence of HCV cDNA in clone lib, itis overlap with clone 7f, and the amino acids encoded therein 
are shown in Rg. 23. 

55 Clone 141 was isolated using a probe based on the sequence of nucleotides in clone lib. The 
nucleotide sequence of the probe was 

5' CTC TGT CAC CAT ATT ACA AGC GCT ATA TCA 3'. 
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The sequence of HCV cDNA in clone 14i» its overlap with 11b. and the amino acids encoded therein are 
shown in Rg. 24. 

Clone 39c was isolated using a probe based on the sequence of nucleotides in clone 33g. The 
nucleotide sequence of the probe was 

5' CTC GTT GCT ACQ TCA CCA CAA TTT GGT GTA 3 

The sequence of HCV cDNA in clone 39c, Its overlap with clone 33g, and the amino acids encoded therein 
are shown in Rg. 25. ' 



tV.A.18. The Coniposfte HCV cDNA Sequence Derived frbna Isolated Clones Containing HCV cDNA 

The HCV cDNA sequences in the isolated clones described supra have been aligned to create a 
composite HCV cDNA sequence. The Isolated clones, aligned in the 5 to 3 direction are: 14i. 7f, 7e, 8h. 
33c. 40b, 37b, 35, 36, 81, 32, 33b, 25c. 14c, Bf, 33f, 33g. and 39c. 

A composite HCV cDNA sequence derived from the isolated clones, and the amino acids encoded 
therein, is shown in Rg. 26, 

in creating the composite sequence the following sequence, heterogeneities have been considered. 
Clone 33c contains an HCV cDNA of 800 base pairs, which overlaps the cDNAs in clones 40b and 37c. In 
clone 33c, as well as in 5 other overlapping clones, nucleotide #789 is a G. However, in clone 37b (see 
Section IV.A.11), the cooespondtng nucleotide is an A. This sequence difference creates an apparent 
heterogeneity in the amino acids encoded therein, which would be either CYS or TYR, for G or A, 
respectively. This heterogeneity may have important ramifications in terms of protein folding. 

Nucleotide residue #2 in clone Bh HCV cDNA is a T. However, as shown infra, the corresponding 
residue in clone 7e is an A; moreover, an A in this position is also found in 3 other isolated overlapping 
clones. Thus, the T residue in clone 8h may represent a cloning artifact Therefore, in Rg. 26. the residue in 
this position is designated as an A. 

The 3'-termina! nucleotide in clone 8f HCV cDNA is a G, However, the con^esponding residue in clone 
33f, and in 2 other overlapping clones is a T. Therefore, in Rg. 26, tiie residue in this position is designated 
asaT. 

The 3'-terminal sequence in clone 33f HCV cDNA is TTGC. However, the corresponding sequence in 
clone 33g and in 2 other overlapping clones is ATTC. Therefore, in Fig. 26, the corresponding region is 
represented as ATTC. 

Nucleotide residue #4 in clone 33g HCV cDNA is a T. However, in clone 33f and in 2 other overlapping 
clones the corresponding residue is an A. Therefore, in Rg. 26, the corresponding residue is designated as 
an A. 

The 3 -terminus of clone 141 is an AA, whereas the corresponding dinucleotide in clone lib, and in 
three other clones, is TA. Therefore, in Rg. 26, the TA residue is depicted. 
The resolution of other sequence heterogeneities is discussed supra. 

An examination of the composite HCV cDNA indicates that it contains one large ORF. This suggests 
that tine viral genome is translated into a large polypeptide which is processed concomitant with, or 
subsequent to translation. 

IV.A,19. Isolation and Nucleotide Sequences of HCV cDNAs in Clones 12f. 35f, 19g, 26g, and 15e 

The HCV cDNAs in clones 12f, 35f, 19g, 26g, and 15e were isolated essentially by the technique 
described in Section IV.A.17. except tiiat the probes were as indicated below. The frequency of clones 
which hytjridized with the probes was approximately 1 in 50.000 in each case. The nucleotide sequences of 
the HCV cDNAs in tiiese clones were determined essentially as described in Section IV A.2., except that the 
cDNA from the indicated clones were substituted for the cDNA isolated from clone 5-1-1 , 

The isolation of clone 12f, which contains cDNA upstream of the HCV cDNA in Rg. 26, was 
accomplished using a hybridization probe based on the sequence of nucleotides in clone 14i. The 
nucleotide sequence of the probe was 

5' TGC TTG TGG ATG ATG CTA CTC ATA TCC CAA 3'. 
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The HCV cDNA sequence of clone 12f. Its overlap with clone 14i, and the amino acids encoded therein are 
shown in Rg. 27. 

The isolation of clone 35f. which contains cDNA downstream of the HCV. cDNA in Rg. 26, was 
5 accomplished using a hybridization probe based on the sequence of nucleotides in clone 39c. The 
nucleotide sequence of the probe was 

5' AGO AGO GGC GTG AAA AGT GAA GGG TAA CtT 3 . 

70 The sequence of clone 35f, its overlap with the sequence in clone 39c, and the amino acids encoded 
therein are shown in Rg. 28. 

The isolation of clone 19g was accomplished using a hybridization probe based on the 3 sequence of 
clone 35f. The nucleotide sequence of the probe was 

75 5' TTC TCG TAT GAT ACC CGC TGC TTt GAG TCC 3', 

The HCV cDNA sequence of clone I9g. its overiap with the sequence in clone 35f. and the amino acids 
encoded therein are shown in Rg. 29. 

The isolation of clone 26g was accomplished using a hybridization probe based on the 3^ sequence of 
20 clone I9g. The nucleotide sequence of the probe, was 

5' TGT GTG GGG AGG AGT TAG TCG TTA TCT GTG 3. 

The HCV cDNA sequence of clone 26g, its overlap with the sequence in clone 19g. and the amino 
25 acids encoded therein are shown in Rg. 30. 

Clone 15e was isolated using a hybridization probe based on the 3' sequence of clone 26 g. The 
nucleotide sequence of the probe was ^ 

5' GAG ACT CCA GTG AAT TGC TGG CTA GGC AAC 3'. 

30 

The HCV cDNA sequence of clone 15e, its overlap with the sequence in clone 26g. and the amino acids 
encoded therein are shown in Rg. 31. 

the clones described in this Section have been deposited with the ATCC under the terms and 
conditions described in Section II.A., and have beeh assigned the following Accession Numbers, 



Iamt>da-gtl1 


ATCC 


Deposit Date 




No. 




clone 12f 


40514 


10 November 1988 


clone 35f 


40511 


10 November 1988 


clone 15e 


40513 


10 November 1988 


clone k9-1 


40512 


10 November 1988 



The HCV cDNA sequences in the isolated clones described sufsra. have been aligned to create a 
composite HCV cDNA sequence. The isolated clones, aligned in the s' to 3' direction are: 12fi 14i. 7f. 7e, 
8h, 33c. 40b. 37b. 35, 36. 81. 32. 33b. 25c, 14c. 8f 33f, 33g. 39c. 35f, 19g. 26g. and 15e. 

A composite HCV cDNA sequence derived from the isolated clones, and the amino acids encoded 
therein, is shown in Rg. 32. 

60 

iV.A.20. Alternative Method of Isolating cDNA Sequences Upstream of ttje HCV cDNA Sequence in Clone 

Based on the most s' HCV sequence in Rg. 32, which Is derived from the HCV cDNA in clone 12f, 
small synthetic oligonucleotide primers of reverse transcriptase are synthesized and used to bind to the 
corresponding sequence in HCV genomic RNA, to prime reverse transcription of the upstream sequences/ 
The primer sequences are proximal to the known s'-terminal sequence of clone 12f. but sufficiently 
downstream to allow the design of probe sequences upstream of the primer sequences. Known standard 
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methods of priming and cloning are used. The resulting cDNA libraries are screened with sequences 
upstream of the priming sites (as deduced from the elucidated sequence In clone 12f). The HCV genomic 
RNA is obtained from either plasma or iiver samples from chimpanzees with NANBH, or from analogous 
samples from humans with NANBH. 

5 

IV,A.21. Aitemative Method Utilizing Tailing to Isolate Sequences from the 5'*Terminal Region of the HGV 
Genome 

TO In order to isolate the extreme s'-terminai sequences of the HCV RNA genome, the cDNA product of 
the first round of reverse transcription, which is duplexed with the template RNA. is tailed with oligo C. This 
is accomplished by incubating the product with terminal transferase in the presence of CTP. The second 
round of cDNA synthesis, which yields the complement of the first strand of cDNA. is accomplished utilizing 
oiigo G as a primer for the reverse transcriptase reaction. The sources of genomic HCV RNA are as 

IS described in Section IVA^O. The methods for tailing with terminal transferase, and for the reverse 
transcriptase reactions are as in Maniatis et al. (1982). The cDNA products are then cloned, screened, and 
sequenced. 

20 NA.22, Aitemative Method Utilizing Tailing to Isolate Sequences from the s'-Terminal Region of the HCV 
Genome 

This method is based on previously used methods for cloning cDNAs of Ravivirus RNA. In this method, 
the RNA is subjected to denaturing conditions to remove secondary structures at the 3 -terminus, and is 
25 then tailed with Poly A polymerase using rATP as a substrate. Reverse transcription of the poly A tailed 
RNA is catalyzed by reverse transcriptase, utilizing oligo dT as a primer. The second strands of cDNA are 
synthesized, the cDNA products are cloned, screened, and sequenced. 



iV-A^3 Creation of Lambda-gtl l HCV cDNA Ubraries Containing Larger cDNA Inserts 

The method used to create and screen the Lambda gtl 1 libraries are essentially as described in 
Section IV.A.1.» except tiiat the library is generated from a pool of larger size cDNAs eluted from tiie 
Sepharose CL-4B column. 

35 

IV.A.24. Creation of HCV cDNA Libraries Using Synthetic Oligomers as Primers 

New HCV cDNA libraries have been prepared from the RNA derived from the infectious chimpanzee 
40 plasma pool described in Section IV.A.1 ., and from the poly A RNA fraction derived from the liver of this 
infected animal. The cDNA was constructed essentially as described by Gubler and Hoffman (1983), except 
that the primers for tiie first cDNA strand syntiiesis were two synthetic oligomers based on the sequence of 
the HCV genome described supra. Primers based on the sequence of clone 11b and 7e were, respectively. 

45 5 CTG OCT TGA AQA ATC 3' 

and 

5' AGT TAG GCT GGT GAT TAT GC 3'. 

50 The resulting cDNAs were cloned into lambda bacteriophage vectors, and screened with various other 
synthetic oligomers, whose sequences were based oh tiie HCV sequence in Rg. 32^ 



iV.B. Expression of Polypeptides Encoded Wttiiin HCV cDNAs and Identification of tiie Expressed Products 
55 as HCV Induced Antigens. 

IV.B.1. Expression of the Polvoeptide Encoded in Clone 5-1-1. 
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The HCV polypeptide encoded within clone 5-1-1 (see Section IV.A^.. supra) was expressed as a 
fusion polypeptide with superoxide dismutase (SOD). This was accomplished by subcloning the clone 5-1-1 
cDNA insert into the expression vecftor pSODcfl (Steimer et al. (1986)) as follows^ 

Rrst. DNA isolated from pSODcfl was treated with BamHI and EcoRI, and the following linker was 
ligated into the linear DNA created by the restriction enzymes: 

5' GAT CCT GGA ATT CTG ATA A 3' 
3' GA CCT TAA GAC TAT TTt AA 5/ 

After cloning, the plasmid containing the insert was isolated. 

Plasmid containing the insert was restricted with EcoRI. The HCV cDNA insert in clone 5-1-1 was 
excised with EcoRI, and ligated into this EcoRI linearized plasmid DNA. The DNA mixture was used to 
transform E. coli strain D1210 (Sadler et ai. (1980)). Recombinarrts with the 5-1-1 cDNA in the correct 
orientation for expression of the ORF shown in Fig. 1 were identified by restriction mapping and nucleotide 
sequencing. 

Recombinant bacteria from one clone were induced to express the SOD-NANBs-i-i polypeptide by 
growing the bacteria in the presence of IPTG. 



IV.B.2. Expression of the Polypeptide Encoded in Clone 81. 

The HCV cDNA contained within clone 81 was expressed as a SOD-NANBa i fusion polypeptide. The 
method for preparing the vector encoding this fusion polypeptide was analogous to that used for the 
creation of the vector encoding SOD-NANB5-1-1. except that the source of the HCV cDNA was clone 81, 
which was isolated as tJescribed in Section IV,A.3, and for which the cDNA sequence was detenmined as 
described in Section !V.A.4. The nucleotide sequence of the HCV cDNA in clone 81. and the putative amino 
acid sequence of the polypeptide encoded therein are shown in Rg. 4. 

The HCV cDNA insert in clone 81 was excised with EcoRI, and ligated into the pSODcfl which 
contained the linker (see IV.B.1.) and which was linearized by treatment with EcoRI. The DNA mixture was 
used to transform E. coli strain D1210. Recombinants with the clone 81 HCV cDNA in the conrect orientation 
ifor expression of the ORF shown in Rg. 4 were identified by restriction mapping and nucleotide 
sequencing. 

Recombinant bacteria from one clone were induced to express the SOD-NANBsi polypeptide by 
growing the bacteria in the presence of IPTG. 



IV,B.3, Identification of the Polypeptide Encoded Within Clone 5-1-1 a£ an HCV and NANBH Associated 
Antigen. 

The polypeptide encoded within the HCV cDNA of clone 5-1-1 was identified as a NANBH associated 
antigen by demonstrating that sera of chimpanzees and humans infected with NANBH reacted Im- 
munologically with the fusion polypeptide. SOD-NANB5-1-1, which is comprised of superoxide dismutase 
at its N-terminus and the in-frame 5-1-1 antigen at its C-tenminus. This was accomplished by "Western" 
blotting (Towbin et al. (1979)) as follows. 

A recombinant strain of bacteria transformed with an expression vector encoding the SOD-NANB5-1-1 
polypeptide, described in Section fV.B.L. was induced to express the fusion polypeptide by growth in the 
presence of IPTG. Total bacterial lysate was subjected to electrophoresis through polyacrylamide gels iri 
the presence of SDS according to Laemmli (1970). The separated polypeptides were transfenred onto 
riitrocellulose filters (Towbin et al. (1979). The filters were then cut into thin strips, and the strips were 
incubated individually with the different chimpanzee and human sera. Bound antibodies were detected by 
further incubation with ^^{-labeled sheep anti-human !g, as described in Section IV Al. 

The characterization of the chimpanzee sera used for the Western blots and the results, shown in the 
photograph of the autoradiographed strips, are presented in Rg. 33. Nitrocellulose strips containing 
polypeptides were incubated with sera derived from chimpanzees at different times during acute NANBH 
(Hutchinson strain) infections (lanes 1-16), hepatitis A infections (lanes 17-24. and 26-33). and hepatitis B 
infections (lanes 34-44). Lanes 25 and 45 show positive controls in which the immunoblots were incubated 
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with serum from the patient used to identify the recombinant clone 5-1-1 in the original screening of the 
lambda-gtl 1 cDNA library (see Section IV.A.1.). 

The band visiblei in the control lanes. 25 and 45, in Fig. 23 reflects the binding of antibodies to the 
NANB5-1-1 moiety of the SOD fusion polypeptide. These antibodies do not exhibit binding to SOD alone, 

5 since this has also been included as a negative control in these sarnples, and would have appeared as a 
band migrating significantly faster than the SOD-NANB5-1 ~^ fusion polypeptide. 

Lanes 1-16 of Rg. 33 show the binding of antibodies in sera safnples of 4 chimpanzees; the sera were 
obtained just prior to infection with NANBH, and sequentially during acute infection. As seen from the figure, 
whereas antibodies which reacted immunologically with the SOD-NANBs-1-1 polypeptide were absent in 

TO sera samples obtained before administration of infectious HCV inoculum and during the early acute phase . 
of infection, alt 4 animals eventually induced circulating antibodies to this polypeptide during the late part of. 
or following the acute phase. Additional bands observed on the immunoblots in the cases of chimps 
numbers 3 and 4 were due to background binding to host bacte:oteins. 

In contrast to the results obtained with sera from chimps infected with NANBH, the development of 

75 antibodies to the NANBs-t-i moiety of the fusion polypeptide was not observed in 4 chimpanzees infected 
with HAV or 3 chimpanzees infected with HBV, The only binding in these cases was background binding to 
the host bacterial proteins, which also occurred in the HCV infected samples. 

The characterization of the human sera used for the Western blots, and the results, which are shown in 
the photograph of the autoradlographed strips, are presented in Fig. 34. Nitrocellulose strips containing 

20 polypeptides were incubated with sera derived from humans at different times during infections with 
NANBH (lanes 1-21), HAV (lanes 33-40). and HBV (lanes 41-49). Lanes 25 and 60 show positive controls in 
whrch the immunoblots were incubated with serum from patient used in the original screening of the 
lambda-gtl 1 library, described supra. Lanes 22-24 and 26-32 show "non-infected" controls in which the 
sera was from "normal" blood donors. 

25 As seen In Rg. 34. sera from nine NANBH patients, including the semm used for screening the lambda- 
gtl 1 library, contained antibodies to the NANB5-1-1 moiety of the fusion polypeptide. Sera from three 
patients with NANBH did not contain these antibodies. It is possible that the anti-NANBs-i-i antibodies will 
develop at a future date in these patients. It is also possible that this lack of reaction resulted from a 
different NANBV agerrt being causative of the disease in the Individuals from which the non-responding 

30 serum was taken. 

Rg. 34 also shows that sera from many patients infected with HAV and HBV did not contain anti- 
NANB5-1 -1 antibodies, and that these antibodies were also not present in the sera fronri "normal" controls. 
Although one HAV patient (lane 36) appears to contain antl-NANBs-i-t antibodies, it is possible that this 
patient had been previously infected with HCV, since the incidence of NANBH is very high and since it is 
35 often subclinical. 

These serological studies indicate that the cDNA in clone 5-1-1 encodes epitopes which are recognized 
specifically by sera from patients and animals infected with BB-NANBV. In addition, the cDNA does not 
appear to be derived from the primate genome. A hybridization probe made from clone 5-1-1 or from clone 
81 did not hybridize to "Southern" blots of control human and chimpanzee genomic DNA from uninfected 
40 Individuals under conditions where unique, single-copy genes are detectable. These probes also did not 
hybridize to Southern blots of control bovine genomic DNA. 

iV.B.4. Expression of the Polypeptide Encoded in a Composite of the HCV cDNAs in Clones 36, 81 and 32 

45 * 

The HCV polypeptide which is encoded in the ORF which extends through clones 36, 81 and 32 was 
expressed as a fusion polypeptide with SOD. This was accomplished by inserting the composite cDNA, 
ClOO, Into an expression cassette which contains the human superoxide dismutase gene, inserting the 
expression cassette into a yeast expression vector, and expressing the polypeptide in yeast 

50 An expression cassette containing the composite CI 00 cDNA derived from clones 36, 81, and 32, was 
constructed by inserting the -1270bp EcoRI fragment into the EcoRI site of the vector pS3-56 (also called 
pS356), yielding the plasmid pS3-56cioo. The construction of CI 00 is described in Section IVA.16, supra. 

The vector pS3-56, which is a pBR322 derivative, contains an expression cassette which is comprised 
of the ADH2/GAPDH hybrid yeast promoter upstream of the human superoxide dismutase gene, and a 

55 downstream GAPDH transcription terminator. A similar cassette, which contains these control elements and 
the superoxide dismutase gene has been described in Cousens et aL (1987), and in copending application 
EPO 196.056, published October 1, 1986, which is commonly owned by the herein assignee. The cassette 
in pS3-56. however, differs from that in Cousens et al. (1987) in that the heterologous proinsulin gene and 
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the immunoglobulin hinge are deleted, and in that the glni54 of the superoxide dismutase is followed by an 
adaptor sequence which contains an EcoRI site. The sequence of the adaptor is: 

5'-AAT TTG GGA ATT CCA TAA TGA G -3' 

AC CCT TAA GGT ATT ACT CAG CT 

The EcoRI site allows the insertion of heterologous sequences which, when expressed from a vector 
containing the cassette, yield polypeptides which are fused to superoxide dismutase via an oligopeptide 
linker containing the amino acid sequence: 
-asrt-leu-gly-ile-arg-. 
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A sample of pS356 has been deposited on 29 April 1988 under the terms of the Budapest Treaty with 
the American Type Culture Collection (ATCC), 12301 Paridawn Dr., Rockville. Maryland 20853, and has 
been assigned Accession No. 67683. The terms and conditions for availability and access to the deposit, 
and for maintenance of the deposit are the same as those specified in Section HA., for strains containing 
NANBV-cDNAs. This deposit Is intended for convenience only» and is not required to practice the present 
irivention in view of the description here. The deposited material is hereby incorporated herein by reference, 

After recombinants containing the CI 00 cDNA insert in the correct orientation were isolated, the 
expression cassette containing the CI 00 cDNA was excised from pS3-56cioo with BamHI. and a fragment of 
-3400bp which contains the cassette was isolated and purified. This fragment was then Inserted into the 
BamHI site of the yeast vector pAB24. 

Plasmid pAB24, the significant features of which are shown in Rg. 35. is a yeast shuttle vector which 
contains the complete 2 micron sequence for replication [Broach [1981)] and pBR322 sequences. It also 
contains the yeast URA3 gene derived from plasmid YEp24 [Botstein et al. 1979)]. and the yeast LEU^** 
gene derived from plasmid pC1/1. EPO Pub. No. 116,201. Plasmid pAB24 was constructed by digesting 
YEp24 with EcoRI and religating the vector to remove the partial 2 micron sequences. The resulting pfasmid 
YEP24deltaRI. was linearized by digestion with Clal and ligated with the complete 2 micron plasmid which 
had been linearized with Clal. The resulting plasmid, pCBou, was then digested witii Xbai and the 8605 bp 
vector fragment was gel isolated. This isolated Xbal fragment was ligated with a 4460 bp Xbal fragment 
containing the LEU^** gene isolated from pCI/1; the orientation of LEU^ the gene is in the same direction as 
tiie URA3 gene. Insertion of the expression was in the unique BamHI site of tiie pBR322 sequence, thus 
interrupting the gene for bacterial resistance to tetracycline. 

The recombinant plasmid which contained the SOOCIOO expression cassette, pAB24C100-3. was 
transformed into yeast strain JSC 308, as well as into other yeast strains. The cells were transformed as 
described by Hinnen et al. (1978), and plated onto ura-selective plates. Single colonies were inoculated into 
leu-selective media and grown to saturation. The culture was Induced to express the SOD-ClOO polypeptide 
(called CI 00-3) by growth in YEP containing 1% glucose. 

Strain JSC 308 is of the genotype MAT @, Ieu2, ura3(del) DM15 (QAP/ADR1) Integrated at the ADR1 
locus. In JSC 308, over-expression of the positive activator gene product, ADR1 , results in hyperderepres- 
sion (relative to an ADR1 wild type control) and signtficantiy higher yields of expressed heterologous 
proteins when such proteins are syntf>esized via an ADH2 UAS regulatory system. The construction of the 
yeast strain JSC 308 Is disclosed in copending application, U.S. Serial No. (Attomey Docket No. 2300- 
0229), filed concunrently herewith, and which is hereby Incorporated herein by reference. A sample of JSC 
308 has been deposited on 5 May 1988 with the ATCC under the conditions of the Budapest Treaty, and 
has been assigned Accession No. 20879. The terms and conditions for availability and access to the 
deposit, and for maintenance of the deposit are the same as those specified in Section IIA, for strains 
containing HCV cDNAs. 

The complete CI 00-3 fusion polypeptide encoded in pAB24C100-3 should contain 154 amino acids of 
human SOD at the amino-termlrius. 5 amino acid residues derived from the synthetic adaptor containing the 
EcoRI site, 363 amino acid residues derived from CiOO cDNA, and 5 carboxy-terminal amino adds derived 
from tile MS2 nucleotide sequence adjoining the HCV cDNA sequence in clone 32. (See Section IV.A.7.) 
The putative amino acid sequence of the carboxy-terminus of this polypeptide, beginning at the penultimate 
Ala residue of SOD. is shown in Rg. 36; also shown is the nucleotide sequence encoding this portion of the 
polypeptide. 
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IV.B>5. Iderttlficatton of the Polypeptide Encoded within CI 00 as an NANBH Associated Antigen 

The C100-3 fusion polypeptide expressed from plasmid pAB24C100-3 in yeast strain JSC 308 was 
characterized with respect to size, and the polypeptide encoded within CI 00 was identified as an NANBH- 

5 associated antigen by its immunological reactivity with serum from a human with chronic NANBH. 

The CI 00-3 polypeptide, which was expressed as described in Section IV.B.4., was analyzed as follows. 
Yeast JSC 308 cells were transformed with pAB24, or with pAB24Cl00-3, and were Induced to express the 
heterologous plasmid encoded polypeptide. The induced yeast ceils In 1 ml of culture {ODq^o nm -20) were 
pelleted by centrifugation at 10.000 rpm for 1 minute, and were lysed by vortexing them vigorously (10x1 

TO min) with 2 volumes of solution and 1 volume of glass beads (0.2 millimicron diameter). The solution 
contained 50 mM Tris-HCI, pH 8.0. 1 mM EDTA, 1mM phenylmethylsulphonyl fluoride (PMSF), and i 
microgram/ml pepstatin. Insoluble material in the lysate, which includes the CI 00-3 polypeptide, was 
collected by centrifugation (10,000 rpm for 5 minutes), and was dissolved by boiling for 5 minutes in 
Laemmli SDS sample buffer. [See Laemmli (1970)]. An amount of polypeptides equivalent to that in 0.3 ml 

75 of the induced yeast culture was subjected to electrophoresis through 10% polyacrylamide gels in the 
presence of SDS according to Laemmli (1970). Protein standards were co-electrophoresed on the gels. Gels 
containing the expressed polypeptides were either stained with Coomassle brilliant blue, or were subjected 
to "Western" blotting as described in Section IV,B.2m using serum from a patient with chronic NANBH to 
determine the Immunological reactivity of the polypeptides expressed from pAB24 and from pAB24C100-3. 

20 The results are shown in Rg. 37. In Rg. 37A the polypeptides were stained with Coomassie brilliant 
blue. The insoluble poiypeptide(s) from JSC 308 transformed with pAB24 and from two different colonies of 
JSC transformed with pAB24C100-3 are shown in lane 1 (pAB24), and lanes 2 and 3. respectively. A 
comparison of lanes 2 and 3 with lane 1 shows the induced expression of a polypeptide corresponding to a 
molecular weight of -54.000 daltons from JSC 308 transformed with pAB24C100-3, which is not induced in 

25 JSC 308 transformed with pAB24. This polypeptide is indicated by the arrow. 

Rg. 378 shows the results of the Western blots of ^e insoluble polypeptides expressed In JSC 308 
transformed with pAB24 (lane 1). or with pAB24C100-3 (lane 2). The polypeptides expressed from pAB24 
were not immunologically reactivie with'seirum from a human with NANBH. However, as indicated by the 
arrow. JSC 308 transfonmed with pAB24C100-3 expressed a polypeptide of -54.000 dalton molecular 

30 weight which did react immunologically with the human NANBH serum. The other immunologically reactive 
polypeptides in lane 2 may be degradation and/or aggregation products of this -54,000 dalton poIyp)eptide. 



IV.B.6. Purtfication of Fusion Polypeptide C100-3 

35 

The fusion polypeptide. CI 00-3, comprised of SOD at the N-terminus and in-frame C100 HCV- 
polypeptide at the C-terminus was purified by differential extraction of tiie insoluble fraction of the extracted 
host yeast cells in which the polypeptide was expressed. 

The fusion polypeptide, CI 00-3, was expressed in yeast strain JSC 308 transformed with pAB24Cl00-3, 
40 as described in Section IV.B.4. The yeast cells were then lysed by homogenizaOon, the insoluble material in 
^ the lysate was extracted at pH 12.0, and CI 00-3 in the remaining insoluble fraction was solubilized in buffer 
containing SDS. 

The yeast lysate was prepared essentially according to Nagahuma et al. (1984). A yeast cell suspension 
was prepared which was 33% cells (v/v) suspended in a solution (Buffer A) containing 20 mM Tris HCl. pH 

45 8.0, 1 mM dithiothreitol. and 1 mM phenyimethyisulfonylfluoride (PMSF). An aliquot of the suspension (15 
ml) was mixed with an equal volume of glass beads (0.45-0.50 mm diameter), and the mixture was vortexed 
at top speed on a Super Mixer (Lab Line Instruments. Inc.) for 8 min. The homogenate and glass beads 
were separated, and the glass beads were washed 3 times with the same volume of Buffer A as the original 
packed cells. After combining the washes and homogenate, the insoluble material in the lysate was 

50 obtained by cerltrifuging the homogenate at 7,000 x g for 15 minutes at 4*C. resuspending thd pellets in 
Buffer A equal to twice the volume of original packed cells, and re-pelleting the material by ceritrrfugation at 
7.000 X g for 15 min. This washing procedure was repeated 3 times. 

The insoluble material from the lysate was exfracted at pH 12.0 as follows. The pellet was suspended in 
buffer containing 0.5 M NaCI, 1 mM EDTA, where the suspending volume was equal to 1 .8 times the of the 

55 original packed cells. The pH of the suspension was adjusted by adding 0.2 volumes of 0.4 M Na 
phosphate buffer. pH 12.0. After mixing, the suspension was centrifuged at 7.000 x g for 15 min at 4'C, 
and the super natant removed. The extraction was repeated 2 times. The extracted pellets were washed by 
suspending them in 0.5 M NaCI, 1 mM EDTA, using a suspension volume equal to two volumes of the 
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original packed cells, followed by centrifugation at 7.000 x g for 15 min at 4' C, 

The CI 00-3 polypeptide in the extracted pellet was solubilized by treatment with SDS. The pellets were 
suspended in Buffer A equal to 0.9 volumes of the original packed eel! volume, and 0.1 volumes of 2% SDS 
was added. After the suspension was mixed, it was centrifuged at 7,000 x g for 15 min at 4*C. The 
resulting pellet was extracted 3 more times with SDS. The resulting supematants. which contained C1 00-3 
were pooled. 

This procedure purifies C100-3 more than 10-fold from the insoluble fraction of the yeast homogenate, 
and the recovery of the polypeptide is greater than 50%., 

The purified preparation of fusion polypeptide was analyzed by polyacrylamide gel electrophoresis 
according to Laemmli (1970). Based upon this analysis, the polypeptide was greater than 80% pure» and 
had an apparent molecular weight of -54,000 daltons. 



IV-C- Identification of RNA in infected Individuals Which Hybridizi&s to HCV cDNA. 



IV.C.I. Identification of RNA in the Liver of a Chimpanzee With NANBH Which Hybridizes to HCV cDNA. 



RNA from the liver of a chimpanzee which had NANBH was shown to contain a species of RNA which 
hybridized to the HCV cDNA contained within clone 81 by Northem blotting, as follows. 

RNA was isolated from a liver biopsy of the chimpanzee from which the high titer plasma was derived 
(see Section IV.A.1.) using techniques described in Maniatis et al. (1982) for the isolation of total RNA from 
mammalian cells, and for its separation Into poly A* and poly A" fractions. These RNA fractions were 
subjected to electrophoresis on a formaldehyde/agarose gel (1% wA/). and transfen^ed to nitrocellulose. 
(Maniatis et al. (1982)). The nitrocellulose filters were hybridized witii radiolabeled HCV cDNA from clone 81 
(see Rg. 4 for the nucleotide sequence of the insert.) To prepare the radiolabeled probe, tiie HCV cDNA 
insert isolated from clone 81 was radiolabeled witii ^^p ^y nick translation using DNA Polymerase I 
(Maniatis et al. (1982)). Hybridization was for 18 hours at 42* C in a solution containing 10% (w/v) Dextran 
sulphate. 50% (w/v) deionized formamide. 750 mM NaCI, 75 mM Na citrate. 20 mM Na2HP04. pH 6.5. 
0.1% SDS. 0.02% (v/N) bovine serum albumin (BSA). 0.02% (w/v) FicolMOO. 0.02% (w/v) polyvinylpyr- 
rolidone. 100 micrograms/mi salmon spemn DNA which had been sheared by sonication and denatured, and 
10^ CRM/ml of the nick-translated cDNA probe. 

An autoradiograph of ttie probed filter is shown in Rg. 38. Lane 1 contains 32p.|abeled restriction 
fragment mariners. Lanes 2-4 contain chimpanzee liver RNA as follows: lane 2 contains 30 micrograms of 
total RNA; lane 3 contains 30 micrograms of poly A- RNA: and lane 4 contains 20 micrograms of poly A + 
RNA. As shown In Rg. 38. tiie liver of the chimpanzee witii NANBH contains a heterogeneous population of 
related poly A+ RNA molecules which hybridizes to tiie HCV cDNA probe, and which appears to be from 
about 5000 nucleotides to about 11.000 nucleotides in size. This RNA. which hybridizes to tiie HCV cDNA. 
could represent viral genomes and/or specific transcripts of the viral genome. 

The experiment described in Section IV.C.2., infra, is consistent with the suggestion tiiat HCV contains 
an RNA genome. 



'V C.2. Identification of HCV Derived RNA in Semm from Infected Individuals. 

Nucleic acids were extracted from particles isolated from high titer chimpanzee NANBH plasma as 
described in Section IV./V.1.. Aiiquots (equivalent to 1 ml of original plasma) of tiie isolated nucleic acids 
were resuspended in 20 microliters 50 mM Hepes. pH 7.5. 1 mm EDTA and 16 microgramsAnI yeast 
soluble RNA. The samples were denatured by boiling for 5 minutes followed by immediate freezing, and 
were treated witii RNase A (5 microliters containing 0.1 mg/ml RNase A in 25 mM EDTA. 40 mM Hepes. pH 
7.5) or with DNase I (5 microliters containing 1 unit DNase I in 10 mM MgCb. 25 mM Hepes, pH 7.5); 
control samples were incubated witiiout enzyme. Following incubation. 230 microliters of ice-cold 2XSSC 
containing 2 micrograms/ml yeast soluble RNA was added, and tiie samples were filtered on a niti-ocel- 
lulose filter. The filters were hybridized witii a cDNA probe from clone 81 . which had been 32p.|abeled by 
nick-translation. Rg. 39 shows an autoradiograph of tiie filter. Hybridization signals were detected in tiie 
DNase treated and control samples (lanes 2 and 1. respectively), but were not detected in the RNase 
treated sample (lane 3). Thus, since RNase A treatment desti-oyed tiie nucleic acids isolated from tiie 
particles, and DNase I treatment had no effect, tiie evidence strongly suggests tiiat the HCV genome is 
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composed of RNA. 

IV.C,3. Detection of Amplified HCV Nucleic Acid Sequences derived from HCV Nucleic Acid Sequences in 
5 Liver anT PiasmaS'pecimens from Chimpanzees with NANBH 

HCV nucleic acids present in liver and plasma of chimpanzees with NANBH. and in control chim- 
panzees, were amplified using essentially the polymerase chain reaction (PCR) technique described by 
Saiki et aL (1986). The primer oligfonucleotides were derived from the HCV cDNA sequences in clone 81, or 
10 clones 36 and 37. The amplified sequences were detected by gel electrophoresis and Southern blotting, 
using as probes the appropriate cDNA oligomer with a sequence from the region between, but not 
including, the two primers. 

Samples of RNA containing HCV sequences to be examined by the amplification system were isolated 
from liver biopsies of three chimpanzees with NANBH, and from two control chimpanzees. The isolation of 
75 the RNA fraction was by the guanidinium thiocyanate procedure described in Section IV.C.1. 

Samples of RNA which were to be examined by the amplification system were also isolated from tiie 
plasmas of two chimpanzees with NANBH, and from one control chimpanzee, as well as from a pool of 
plasmas from control chimpanzees. One infected chimpanzee had a CID/ml equal to or greater than 10^, 
and the other infected chimpanzee had a CID/ml equal to or greater than 10^. 
20 The nucleic acids were extracted from the plasma as follows. Either 0.1 ml or 0.01 ml of plasma was 
diluted to a final volume of 1.0 ml, with a TENB/proteinase K/SDS solution (0.05 M Tris-HCL, pH 8.0. 0.001 
M EDTA, 0.1 M NaCl. 1 mg/ml Proteinase K, and 0.5% SDS) containing 10 micrograms/ml polyadenylic 
acid, and incubated at 37* C for 60 minutes. After this proteinase K digestion, the resultant plasma fractions 
were deproteinized by extraction with TE (10.0 mM Tris-HCl. pH 8.0, 1 mM EDTA) saturated phenol. The 
25 phenol phase was separated by centrifugation, and was reextracted with TENS containing 0.1% SDS. The 
resulting aqueous phases from each extraction were pooled, and extracted twice with an equal volume of 
phenoi/chloroform/isoamyl alcohol [1:1(99:2)], and then twice with an equal volume of a 99:1 mixture of 
chioroform/isoamyl alcohol. Following phase separation by centrifugation. the aqueous phase was brought 
to a final concentration of 0.2 M Na Acetate, and the nucleic acids were precipitated by the addition of two 
30 volumes of ethanol. The precipitated nucleic acids were recovered by uitracentrifugation in a SW 41 rotor at 
38 K, for 60 minutes at 4* C. 

In addition to the above, the high titer chimpanzee plasma and the pooled control plasma altematively 
were extracted with 50 micrograms of poly A carrier by the procedure of Chomcyzski and Sacchi (1987). 
This procedure uses an acid guanidinium thiocyanate extraction. RNA was recovered by centrifugation at 
35 10.000 RPM for 10 minutes at 4* C in an Eppendorf microfuge. 

On two occasions, prior to the synthesis of cDNA in the PCR reaction, the nucleic acids extracted from 
plasma by the proteinase K/SDS/phenol method were further purified by binding to and elution from 8 and 
S Elutip-R Columns. The procedure followed was according to the manufacturer's directions. 

The cDNA used as a template for the PCR reaction was derived from the nucleic acids (either total 
40 nudeic acids or RNA) prepared as descriljed Bbove. Following ethanol precipitation, the precipitated nucleic 
adds were dried, and resuspended in DEPC treated distilled water. Secondary structures in the nucleic 
acids were disrupted by heating at 65* C for 10 minutes, and the samples were immediately cooled on ice. 
cDNA was synthesized using 1 to 3 micrograms of total chimpanzee RNA from liver, or from nucleic acids 
(or RNA) extracted from 10 to 100 microliters of plasma The synthesis utilized reverse transcriptase, and 
45 was in a 25 microliter reaction, using the protocol specified by the manufacturer, BRL. The primers for 
cDNA synthesis were those also utilized in the PCR reaction, described below. All reaction mixtures for 
cDNA synthesis contained 23 units of the RNAase inhibitor. RNASINTM (Fisher/Promega). Following cDNA 
synthesis, the reaction mixtures were diluted with water, boiled for 10 minutes, and quickly chilled on ice. 
The PCR reactions were performed essentially according to the manufacturer's directions (Cetus- 
50 Perkin-Elmer), except for tiie addition of 1 microgram of RNase A. The reactions were canied out in a final 
volume of 100 microliters. The PCR was performed for 35 cycles, utilizing a regimen of 37* C, 72* C, and 
94*C. 

The primers for cDNA syrthesis and for tiie PCR reactions were derived from the HCV cDNA 
sequences in either clone 81. clone 36, or clone 37b. (The HCV cDNA sequences of clones 81, 36, and 37b 
65 are shown in Rgs. 4. 5, and 10, respectively.) The sequences of the two 16-mer primers derived from clone 
81 were: 

5' CAA TCA TAC CTG ACA G 3 
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and 

5' GAT AAC CTC TGC CTG A 3 . 

The sequence of the primer from clone 36 was: 

5 

5 GCA TGT CAT GAT GTA T 3'. 

The sequence of the primer from clone 37b was: 

70 5 ACA ATA CGT GTG TCA C 3'. 

In the PGR reactions, the primer pairs consisted of either this two 16-mers derived from clone 81, or the 16- 
mer from clone 36 and the 16-mer from clone 37b. 

The PGR reaction products were analyzed by separation of the products by alkaline gel electrophoresis. 

75 followed by Southem blotting, and detection of the amplified HOV-cDNA sequences with a ^sp-iabeled 
internal oligonucleotide probe derived from a region of the HCV cDNA which does not overlap the primers. 
The PGR reaction mixtures were extracted with phenol/chloroform, and the nucleic acids precipitated from 
the aqueous phase with salt and ethanoL The precipitated nucleic acids were collected by centrifugation, 
and dissolved in distilled water. Aiiquots of the samples were subjected to electrophoresis on 1,8% alkaline 

20 agarose gels. Single stranded DNA of 60. 108. and 161 nucleotide lengths were co-electrophoresed on the 
gels as molecular weight markers. After electrophoresis, the DNAs in the gel were transferred onto Biorad 
Zeta ProbeTM paper. Prehybridizatiori and hybridization, and wash conditions were those specified by the 
manufacturer (Biorad). 

The probes used for the hybridization-detection of amplified HCV cDNA sequences were the following. 

25 When the pair of PGR primers were derived from clone 81. the probe was an 108-mer with a sequence 
corresponding to that which is located in the region between the sequences of the two primers. When the 
pair of PGR primers were derived from clones 36 and 37b, the probe was the nick-translated HCV cDNA 
insert derived from clone 35. The primers are derived from nucleotides 155-170 of the clone 37b insert, and 
206-268 of the clone 36 Insert. The 3'-end of the HCV cDNA insert in clone 35 overlaps nucleotides 1-186 

30 of the insert in clone 36; and the 5'-end of clone 35 insert overiaps nucleotides 207-269 of the insert in 
clone 37b. (Compare Rgs. 5. 8 and 10.) Thus, the cDNA insert in clone 35 spans part of the region between 
the sequences of the clone 36 and 37b derived primers, and is useful as a probe for the amplified 
sequences which include these primers. 

Analysis of the RNA from the liver specimens was according to the above procedure utilizing both sets 

35 of primers and probes. The RNA from the liver of the three chimpanzees with NANBH yielded positive 
hybridization results for amplification sequences of the expected size (161 and 588 nucleotides for 81 and 
36 and 37b. respectively), while the control chimpanzees yielded negative hybridization results. The same 
results were achieved when the experiment was repeated three times. 

Analysis of the nucleic acids and RNA from plasma was also according to the above procedure utiiizing 

40 the primers and probe from clone 81 . The plasmas were from two chimpanzees with NANBH. from a control 
chimpanzee, and pooled plasmas from control chimpanzees. Both of the NANBH plasmas contained nucleic 
acids/RNA which yielded positive results in the PGR amplified assay, while both of the control plasmas 

^ yielded negative results. These results have been repeatably obtained several times. 

45 

'V'P^ Radioimmunoassay for Detecting HCV Antibodies in Serum from Infected Individuals 

Solid phase radioimmunoassays to detect antibodies to HCV antigens were developed based upon Tsu 
and Herzenberg (1980). Microtiter plates (Immuion 2, Removawell strips) are coated with purified polypep- 

50 tides containing HCV epitopes. The coated plates are incubated witii either human semm samples 
suspected of containing antibodies to the HCV epitopes, or to appropriate controls. During incubation, 
antibody, if present, is immunologically bound to the solid phase antigen. After removal of the unbound 
material and washing of the microtiter plates, complexes of human antibody-NANBV antigen are detected 
by incubation with ^25|,|abeled sheep anti-human immunoglobulin. Unbound labeled antibody is removed by 

55 aspiration, and tiie plates are washed. The radioactivity in individual wells is determined; the amount of 
bound human anti-HCV antibody is proportional to the radioactivity in tiie well. 
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IV.D.1. Purificaaon of Fusion Polypeptide SOD-NANBs^i-i 

The fusion polypeptide SOD-NANBs-i-i, expressed in recombinant bacteria as described in Section 
IV.B.1 was purified from the recombinant B coH by differential extraction of the cell extracts with urea, 

5 followed by chromatography on anion and cation exchange columns as follows. 

Thawed cells from 1 liter of culture were resuspended in 10 ml of 20% (w/v) sucrose containing 0.01 M 
Tris HCI. pH 8.0, and 0.4 ml of 0.5M EDTA, pH 8.0 was added. After 5 minutes at O'C, the mixture was 
centrifuged at 4,000 x g for 10 minutes. The resulting pellet was suspended in 10 ml of 25% (w/v) sucrose 
containing 0.05 M Tris HCI, pH 8.0, 1 mM phenylmethylsulfonytfluoride (PMSF) and 1 microgram/ml 

10 pepstatin A. followed by addition of 0.5 ml lysozyme (10 mg/ml) and incubation at O' C for 10 minutes. After 
the addition of 10 ml 1% (v/v) Triton X-100 in 0.05 M Tris HCI, pH 8.0, 1 mM EDTA, tfie mixture was 
incubated an additional 10 min at 0*C with occasional shaking. The resulting viscous solution was 
homogenized by passage 6 times through a sterile 20-gauge hypodermic needle, and centrifuged at 13,000 
x g for 25 minutes. The pelleted material was suspended in 5 ml of 0.01 M Tris HCI pH 8.0, and the 

15 suspension centrifuged at 4.000 x g for 10 minutes. The pellet, which contained SOD-NANB5-1-1 fusion 
protein, was dissolved in 5 ml of 6 M urea in 0.02 M Tris HCI, pH 8.0, 1 mM dithiothreitol (Buffer A), and 
was applied to a column of Q-Sepharose Fast Row equilibrated with Buffer A. Polypeptides were eluted 
with a linear gradient of 0.0 to 0.3 M NaCI in Buffer A, After elution, fractions were analyzed by 
polyacrylamide gel electrophoresis in the presence of SDS to determine their content of SOD-NANB5-1-1. 

20 Fractions containing this polypeptide were pooled, and dialyzed against 6 M urea in 0.02 M sodium 
phosphate buffer, pH 6.0^ 1 mM dithiothretto! (Buffer B). The dialyzed sample was applied on a column of 
S-Sepharose Fast Row equilibrated with Buffer B. and polypeptides eluted with a linear gradient of 0.0 to 
0.3 M NaCl in Buffer B. The fractions were analyzed by polyacrylamide gel electrophoresis for the presence 
of SOD-NANB5-1-1. and the appropriate fractions were pooled. 

25 The final preparation of SOD-NANB5-1-1 polypeptide was examined by electrophoresis on 
polyacrylamide gels in the presence of SDS. Based upon this analysis, the preparation was more than 80% 
pure. 



IV-P'2. Purification of Fusion Polypeptide SGD-NANBsi. 

The fusion polypeptide SOD-NANBsi expressed in recombinant bacteria as described in Section 
iV.B.2., was purified from recombinant E. coH by differential extraction of the eel! extracts with urea, followed 
by chromatography on anion and cation exchange columns utilizing the procedure described for the 
35 isolation of fusion polypeptide SOD-NANBs-i-i (See Section IV.D.1.). 

The final preparation of SOD-^4ANB8 1 polypeptide was examined by electrophoresis on polyacrylamide 
gels in the presence of SDS. Based upon this analysis, the preparation was more tiian 50% pure. 



Detection of Antibodies to HCV Epitopes by Solid Phase Radioimmunoassay. 

Serum samples from 32 patients who were diagnosed as having NANBH were analyzed by radioim- 
munoassay (RIA) to determine whetiier antibodies to HCV epitopes present in fusion polypeptides SOD- 
NAN Bs-i-i and SOD-NANBbi were detected. 

45 f^icrotiter plates were coated with SOD-NANBs-t-i or SOD-NANBsi, which had been partially purified 
according to Sections IV.D.1 . and IV.D.2., respectively. The assays were conducted as follows. 

One hundred microliter aliquots containing 0.1 to 0.5 micrograms of SOD-NANB5-1-1 or SOD-NANBsi 
in 0.125 M Na borate buffer, pH 8.3, 0.075 M NaCl (BBS) was added to each well of a microtiter plate 
(Dynatech Immulon 2 Removawell Strips). The plate was incubated at 4* C overnight in a humid chamber, 

50 after which, tiie protein solution was removed and the wells washed 3 times with BBS containing 0.02% 
Triton X-100 (BBST). To prevent non-specific binding, the wells were coated with bovine serum albumin 
(BSA) by addition of 100 microliters of a 5 mg/ml solution of BSA in BBS followed by incubation at room 
temperature for 1 hour; after this incubation the BSA solution was removed. The polypeptides in tiie coated 
wells were reacted with serum by adding 100 microliters of serum samples diluted 1:100 in 0.01 M Na 

55 phosphate buffer, pH 7.2. 0.15 M NaCl (PBS) containing 10 mg/ml BSA, and incubating the serum 
containing wells for 1 hr at 37* C. After incubation, the serum samples were removed by aspiration, and the 
wells were washed 5 times with BBST. Ahti-NANBs-i-i and Anti-NANBsi bound to the fusion polypeptides 
was detenmined by the binding of labeled f' (ab)2 sheep anti-human IgG to the coated welis. ARquots of 
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100 microliters of the labeled probe (specific activity 5-20 microcuries/microgram) were added to each well, 
and the plates were incubated at 37° C for 1 hour, followed by removal of excess probe by aspiration, and 5 
washes with BBST. The amount of radioactivity bound in eaich well was deternr>lned by counting in a 
counter which detects gamma radiation. 

The results of the detection of antl-NANBs^i-i and ariti-NANBsi in individuals with NANBH is 
presented in Table 1 . 



Table I 

Detection of Anti^5*l-1 and Anti*81 in Sera of 
HAKB, HAV and KBV Hepatitis Patients 

l^aLient 

Reference S/N 



Number 


Diaonosii 


Anti-5-l-l 


Anti-81 


I. 


28^ 


Chronic NANB. ivo' 
Chronic NANB, Zvd 
Chronic NANl, XVD 


0.77 
1. 14 
2.11 


4 20 
5.14 
4 .05 


2. 


29^ 


AVH^, NANB, Sporadic 
Chronic, NANB 
Chronic, NANB 


1.09 
33.09 
36.22 


1 .05 
11.39 
13 .67 


3. 




AVH, NANB, XVD 
Chronic NANB, ZVO 
Chronic NANB, ivd 


1-90 
34.17 
32.45 


1 .54 
30.28 
30.84 


4 . 


31 


Chronic NANB, PT* 


16.09 


8.05 


5. 


32^ 


Lata AVH NANB, IVD 
Late AVH NANB, ZVO 


0.69 
0.73 


0.94 
0.68 


6. 




AVH, NANB, ZVD 
AVH, NANB, ZVD 


1.66 
1.S3 


1.96 
0.56 


7. 


34^ 


Chronic NANB, PT 
Chronic NANB, Pt 
Chronic NANB, PT 
Chronic NANB, Pf 


34.40 
45.55 
41.58 
44.20 


7.55 
13.11 
13.45 
15.48 


8. 


35^ 


AVH NANB, ZVD 
'H«al*d* r*c«nt 

NANB, AVH 


31.92 
6.87 


31.95 
4.45 


9. 


36 


Lat« AVH MAMB PT 


11.84 


5.79 


10. 


37 


AVH NANB, ZVD 


6.52 


1.33 


11. 


38 


X,at« AVH HANB, Pt 


39.44 


39,18 


)2. 


39 


Chronic MANB, PT 


42.22 


37.54 


13. 


40 


AVH, MAMB, PT 


1.3S 


1.17 


14. 


41 


Chronic MAMB7 PT 


0.3S 


0.28 
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Patient 

Reference S/N 



Number 


Diagnosis 


Anti-5*l»l 


Anti-Bl 


15. 


42 


AVH t NANo r I VU 


6.25 


2.34 


16. 


4 3 


wnronic iwAfior rx 


U • 74 


0 « 6 I 


17. 


44 


Aim HAiiB D*r 




1.83 


1 Q . 


45 


Chronic, HAKB, PT 


0.52 


0.32 


19. 


46 


AVH, NANB 


23.35 


4.15 


20. 


47 


A*nr lyp* A 




1 . 35 


21. 


48 


AVH, Type A 


1.30 


0.66 


22 . 


49 


AVH, Type A 


1.44 


0.74 


23. 


50 


Resolved Recent AVHr 
Type A 


0.48 


0.56 


24. 


51 


AVH, Tvoe A 
Resolved AVH, Typ« A 


0.68 
0.80 


w • U ^ 

0.65 


29 . 


• 


R«solv*d R*c«nt AVH, 

lype A 
Resolved Recent AVK, 
Type A 


1.38 
0.80 


1.04 
0.65 


26. 


53 


<^*n., 1 jrpe A 
Resolved Recent AVH^ 
Type A 


1.02 


1 . lo 
0.88 


27 . 


54 


AVH, Type A 


1.35 


0.74 


20. 


55 


Late AVH, KBV 


0.58 


O.SS 


29. 


56 


Chronic HBV 


0.84 


1.06 


30. 


57 


Late AVH, KBV 


3.20 


1.60 


31. 


58 


Chronic HBV 


0.47 


0.46 


32. 


59^ 


AVH, HBV 
Healed aVh, ttBV 


0.73 
0.43 


0.60 
0.44 


33. 


60^ 


AVH, HBV 
Healed AVH, KBV 


1.06 
0.75 


0.92 
0.68 
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Pacient 
Reference 
Number * 


Oiaonosis 




S/N 

. Anti-5-l-i 


A.nti-81 


s 


34 . 


6l' 


AVH, HBV 
Healed AVH^ 


HBV 


1.66 
0.63 


0.61 
0.36 






62^ 


AVH. HBV 
Healed AVH, 


HBV 


1.02 
0.41 


0.73 
0.42 


10 


ir>. 


63^ 


AVH, HBV 
HeaieQ avh. 


hdv 


1.24 

JL . 9 9 


1.31 




J7 . 


64» 


AVH, HBV 

Healed AVH, 


HBV 


0.82 
0.53 


0.79 
0.37 


15 


TO. 


65.1 


AVH, HBV 
Healed AVH, 


HBV 


0.95 
0.70 


0.92 
0.50 




J9. 


66^ 


AVH, HBV 
Healed AVH, 


HBV 


1.03 
1.71 


0.68 
1.39 



20 



Sequential serum samples available from these patients 
^Tvo-Intravenus Drug User 
3AVH«Acute viral hepatitis 
^PT«Post transfusion 

' 25 

As seen in Table 1, 19 of 32 sera from patients diagnosed as having NANBH were positive with respect 
to antibodies directed against HCV epitopes present in SOD-tslANBs-i-i and SOD-NANBsi. 

However, the serum samples which were positive were not equally immunologically reactive with SOO 

30 NANBs-i-i and SOD-NANBsi. Serum samples from patient No. 1 were positive to SOD-NANBsi but not to 
SOD-NANBs-i-u Serum samples from patients number 10. 15, and 17 were positive to SOD-NANB5-1-1 
but not to SOD-NANBsi. Serum samples from patients No. 3. 8. 11, and 12 reacted equally with both fusion 
polypeptides, whereas serum samples from patients No. 2. 4, 7. and 9 were 2-3 fold higher in the reaction 
to SOD-NANB5-1.1 than to SOD-NANBgi. These resutts suggest that NANBs-i-t and NANBsi may 

35 contain at least 3 different epitopes; i.e.. it is possible tiiat each polypeptide contains at least 1 unique 
epitope, and that the two polypeptides share at least 1 epitope. 



IV.D.4. Specificity of the Solid Phase RiA for NANBH 

40 

The specificrty of the solid phase RIAs for NANBH was tested by using the assay on serum from 

patients infected with HAV or with HBV and on sera from control Individuals. The assays utilizing partially 
^ purified SOD-NANB5-1-1 and SOD-NANBsi were conducted essentially as described in Section IV.D.3, 

except tiiat the sera was from patients previously diagnosed as having HAV or HBV, or from individuals who 
45 were blood bank donors. The results for sera from HAV and HBV infected patients are presented in table 1, 

The RIA was tested using 1 1 serum specimens from HAV infected patients, and 20 seaim specimens from 

HBV infected patients. As shown in table 1 , none of tiiese sera yielded a positive immunological reaction 

with the fusion polypeptides containing BB-NANBV epitopes. 

The RIA using the NANB5-1-1 antigen was usedlo determine Immunological reactivity of serum from 
50 control individuals. Out of 230 serum samples oiDtained from the normal blood donor population, only 2 

yielded positive reactions in tfie RIA (data not shown). It Is possible that tiie two blood donors from whom 

tiiese serum samples originated had previously been exposed to HCV. 

55 IV.D.5. Reactivity of NANBs-i-i During the Course of NANBH Infection. 

The presence of anti-NANBs-t-i antibodies during tiie course of NANBH infection of 2 patients and 4 
chimpanzees was followed using RIA as described in Section IV.D.3. In addition the RIA was used to 
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75 



20 



determine the presence or absence of anti-NANBs-i-i antibodies during the course of infection of HAV 
and HBV in infected chimpanzees. 

The results, which are presented In tat>le 2. show that with chimpanzees and with humans, anti- 
NANB5-1-1 antibodies were detected following the onset of the acute phase of NANBH infection. Anti- 
5 NANB5-1-1 antibodies were not detected in serum samples from chimpanzees infected with either iHAV or 
HBV, Thus ant-NANBs-i-i antibodies serve as a marker for an individual's exposure to HCV. 

T*bl* 2 

Seroconversion in 5equ*nti«i Seru» S«»pl«» tro« 
10 Hepatitif P«tx«nt« «nd Chi»p«nt»«i Usi^ng S-l-l Anttgen 

t»^t)»nt/ SampU 0«t« (0«yt) Hepatitis 
Chimo f o«Lnoeuiation dsv) V iruses 

Patient 29 Tl »AHB 

•T*20S 

Patient 30 T HAWS 

T*307 

Chimp I 0 WAHB 

76 
119 
154 

Chintp 2 0 HXHB 

21 
73 
138 

Chimp 3 0 HAHB 

43 
53 
L59 

30 ChUnp 4 -3 NAIIB 

93 
140 

Chimp 5 Q HAV 

25 

35 ^0 

2<8 

Chimp 6 -9 HAV 

15 
41 

40 



45 



25 



50 



Anti-5»l-l 


ALT 

(mu/ml ' 


1.09 


1180. 




^ • J 


36.22 




1.90 


1830 


34.17 


290 


J « • ^ ? 


2*6 


0-87 


0 


0-93 


?'l 


23.67 


19 


32.41 




1. 00 


5 


1.08 


52 


4.64 


13 


25.01 




1.08 


8 


1.44 


205 


1.92 


14 


11.87 


6 


1.12 


11 


1.25 


132 


6.60 




17.51 




1.50 


4 


2.39 


147 


1.92 


18 


1.53 


5 


0.8S 






106 


0.81 


10 


1.33 
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70 



75 







H«p«riti« 


AAti-5-l-l 


ALT 


Cliimi? 


r o«inoeulAtion d«v> 




fS/Nl 


fmu/mli 


Chiwp 7 


0 


RAV 


1.17 


7 




22 




1.60 


83 








1.55 


5 




139 




l.€0 




Chimp 9 


0 


HAV 


0.77 


15 


26 






130 








1.77 


8 




205 




1.27 


S 


Chimp 9 


•250 


KBV 


1.74 








3.29 


9 




435 




2.77 


6 


ChiPio 10 


0 


HBV 


2.35 


8 




llL-llS (pool) 




2.74 


95-156 (pcc 




20S 




2.05 


9 




240 




1.78 


13 


Chimp 11 


0 


KBV 


1.82 


11 




28-56 (pooX) 




1.26 


8-100 (poo 




169 






9 




223 




'o.52 


10 



»T«d«y o£ initial* ««mplinq 



fV.E. Purification of Polyclonal Serum Antibodies to NANBs-i-i 

On the basis of the specific immunoiogical reactivity of tiie SOD-NANBs-i-i polypeptide with tiie 

30 antibodies in serum samples from patients with NANBH. a metiiod was developed to purify serum 
antibodies which react immunologically with tiie epitope(s) in NANB5-1-1. This method utilizes affinity ' 
chromatography. Purified SOD-NAN Bs- 1-1 polypeptide (see Section IV.D.1) was attached to an insoluble 
support; the attachment is such that the immobilized polypeptide retains its affinity for antibody to 
NANB5-1-1. Antibody in serum samples is absorbed to the matrix-bound polypeptide. After washing to 

35 remove noh-spedfically bound materials and unbound materials, the bound antibody is released from the 
bound SOD-HCV polypeptide by change in pH, and/or by chaotropic reagents, for example, urea 

Nitrocellulose membranes containing bound SOD-NANB5-1-1 were prepared as follows. A nitrocel- 
lulose membrane, 2.1 cm Sartorius of 0.2 micron pore size, was washed for 3 minutes three times with 
BBS. SOD-NANB5-1-1 was bound to tiie membrane by Incubation pf the purified preparation in BBS at 

40 room temperature for 2 hours; alternatively it was incubated at 4'C overnight. The solution containing 
unbound antigen was removed, and tiie filter was washed three times witii BBS for three minutes per wash. 
The remaining active sites on tiie membrane were blocked witii BSA by incubation with a 5 mg/ml BSA 
solution for 30 minutes. Excess BSA was removed by washing the membrane with 5 times with BBS and 3 
times with distilled water. The membrane containing the viral antigen and BSA was then treated with 0.05 M 

45 g*yc*n© hydrochloride, pH 2.5. 0.10 M NaCI (GlyHCI) for 15 minutes, followed by 3 three minute washes 
with PBS. 

Polyclonal anti-NANBs-i -1 antibodies were isolated by incubating \he membranes containing the fusion 
polypeptide witii serum from an Individual with NANBH for 2 hours. After tiie incubation, tiie filters were 
washed 5 times with BBS, and twice with distilled water. Bound antibodies were tiien eluted from each fitter 
50 with 5 elutions of GlyHa. at 3 minutes per elution. The pH of tiie eluates was adjusted to pH 8.0 by 
collecting each eluate in a test tube containing 2.0 M Tris HCl, pH 8.0. Recovery of the anti-NANBs-i-i 
antibody after affinity chromatography is approximately 50%. 

The nitrocellulose rhembranes containing the bound viral antigen can be used several times without 
appreciable decrease in binding capacity. To reuse tine membranes, after the antibodies have been eluted 
55 the membranes are washed with BBS tiiree times for 3 minutes. Tliey are tiien stored in BBS at 4* C. 



ThQ Capture of HCV Particles from Infected Plasma Using Purified Human Polyclonal Anti-HCV 



49 



1F> 0 318 216 A1 

Antibodies; Hybridization of the Nucleic Acid in tlie Captured Particies to HCV cDNA 



IV.F.1 . The Capture of HCV Particles from infected Plasma Using Human Poiycionai Anti-HCV Antibodies 

Protein-nucleic acid complexes present in infectious plasma of a chimpanzee with NANBH were 
isolated using purified human polyclonal anti-HCV antibodies which were bound to polystyrene beads. 

Polyclonal anti-NANBs-i-i antibodies were purified from serum from a human with NANBH using the 
SOD-HCV polypeptide encoded in clone 5-1-1. The method for purification was that described in Section 
!V.E. 

The purified anti-NANBs-i-i antibodies were bound to polystyrene beads (1/4" diameter, specular fin 
ish. Precision Plastic Bail Co., Chicago, Illinois) by incubating each at room temperature ovemight with 1 ml 
of antibodies (1 microgram/m! in borate buffered saline. pH 8.5). Following the overnight incubation, the 
beads were washed once with TBST [50 mM Tris HCI. pH 8.0, 150 mM NaCi. 0.05% (v/v) Tween 20], and 
then with phosphate buffered saline (PBS) containing 10 mg/ml BSA. 

Control beads were prepared in an Identical fashion, except that the purified anti-NANBs-i-i antibodies 
were replaced with total human immunoglobulin. 

Capture of HCV from NANBH infected chimpanzee plasma using the anti-NANBs-i-i antibodies bound 
to beads was accomplished as follows. The plasma from a chimpanzee with NANBH used is described in 
Section IVJV.1.. An aliquot (1 ml) of the NANBV infected chimpanzee plasma was incubated for 3 hours at 
37 *C with each of 5 beads coated with either anti-NANBs-i-i antibodies, or with control immunoglobulins. 
The beads were washed 3 times with TBST. 



IV.F.2. Hybridization of the Nucleic Acid in the Captured Particles to NANBV-cDNA 

The nucleic acid component released from the particles captured with anti-NANBs-t-i antibodies was 
analyzed tor hybridization to HCV cDNA derived from clone 81 . 

HCV particles were captured from NANBH infected chimpanzee plasma, as described in IV.F.l. To 
release tiie nucieic acids from the particles, tiie washed beads were incubated for 60 min. at 37* C witii 0.2 
ml per bead of a solution containing proteinase k (1 mg/ml), 10 mM Tris HCI. pH 7.5. 10 rriM EDTA, 0.25% 
(w/v) SDS. 10 micrograms/ml soluble yeast RNA, and tiie supernatant solution was removed. The 
supernatant was extracted with phenol and chloroform, and the nucleic acids precipitated with ethanol 
ovennight at -20* C. The nucleic add precipitate was collected by centrtfugation, dried, and dissolved in 50 
mM Hepes. pH 7.5. Duplicate aliquots of the soluble nucleic acids from the samples obtained from beads 
coated with anti-NANBs-i-i antibodies and witii control beads containing total human immunoglobulin were 
filtered onto to nitrocelluiose filters. The filters were hybridized wrtii a ^^p-iabeled. nick-transtated probe 
made from the purified HCV cDNA fragment In clone 81. The methods for preparing tiie probe and for tiie 
hybridization are described in Section IV.C.1.. 

Autoradiographs of a probed filter containing the nucleic acids from particles captured by beads 
containing anti-NANBs-t-i antilx>dies are shown in Rg, 40. The extract obtained using tiie anti-NANBs-i-i 
antibody (A1.A2) gave clear hybridization signals relative to tiie control antibody extract (A3.A4) and to 
control yeast RNA (B1.B2). Standards consisting of Ipg, 5pg. and lOpg of tiie purified, clone 81 cDNA 
fragment are shown in C1-3, respectively. 

These results demonstrate tiiat ttie particles captured from NANBH plasma by anti-NANBs-1-1- 
antibodies contain nucleic acids which hybridize with HCV cDNA in clone 81, and thus provide further 
evidence that tiie cDNAs in these clones are derived from the etiologic agent for NANBH. 

ly.G, Immunological Reactivity of CI 00-3 with Purified Anti-NANBs-i-i Antibodies 

The immunological reactivity of CI 00-3 fusion polypeptide witii anti-NANBs-t-i antibodies was deter- 
mined by a radioimmunoassay, in which the antigens which were bound to a solid phase were challenged 
witii purified anti-NANBs-i-i antibodies, and the antigen-antibody complex detected with ^^s labeled sheep 
anti-human antibodies. The immunological reactivity of C100-3 polypeptide was compared witii tiiat of SOD- 
NANB5-1-1 antigen. 

The fusion polypeptide CI 00-3 was synthesized and purified as described in Section IV.B.5. and in 
Section IV.B.6., respectively. The fusion polypeptide SOD-NANBs-i-t was synthesized and purified as 
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described in Section IV.B.1. and in Section IV.D.1., respectively- Purified anti-NANBs-i-i antibodies were 
obtained as described in Section IV.E , 

One hundred nnicroiiter aliquots containing varying amounts of purified CI 00-3 antigen in 0.1 25M Na 
borate buffer, pH 8.3. 0.075M NaCf (BBS) was added to each well of a nriicrotiter plate (Dynatech InnmUlon 2 

5 Removawell Strips). The plate was incubated at 4* C overnight in a humid chamber, after which, the protein 
solution was removed and the wells washed 3 . times with BBS containing 0.02% Triton X-100 (BBST). To 
prevent non-specific binding, the wells were coated with BSA by addition of 100 microliters of a 5 mg/ml 
solution .of BSA in BBS followed by tncul:>ation at room temperature for 1 hour, after which the excess BSA 
solution was removed. The polypeptides in the coated wells were reacted with purified anti-NANBs-i-v 

70 antibodies by adding 1 microgram antibody/well, and incubating the samples for 1 hr at 37* C. After 
incubation, the excess solution was removed by aspiration, and the wells were washed 5 times with BEST- 
Anti-IMANBs-i-i bound to the fusion polypeptides was detemnined by the binding of ^^Habeled F'(ab)2 
sheep anti-human igG to the coaled wells. Aliquots of 100 microliters of the labeled probe (specific activity 
5-20 microcuries/microgram) were added to each well, and the plates were incubated at 37' C for 1 hour. 

75 followed by removal of excess probe by aspiration, and 5 washes with BBST. The amount of radioactivity 
bound in each well was determined by counting in a counter which detects gamma radiation. 

The results of the immunological reactivity of CI 00 with purified anti-NANBs-i-i as compared to that 
of NANBg-i-t with the purified antibodies are shown in Table 3, 

20 Table 3 



Immunological Reactivity of CI 00-3 compared to MANB5.1.1 by 
Radioimmunoassay 


AG(ng) 


RIA (cpm/assay) 


400 


320 


240 


160 


60 


0 


NANBs.1.1 
CI 00-3 


7332 
7450 


6732 
6985 


4954 
5920 


4050 
5593 


3051 
4096 


57 
67 



The results in Table 3 show that anti-NANBs-i-i recognizes an epitope{s) in the ClOO moiety of the 
CI 00-3 polypeptide. Thus NANB5-1-1 and C100 share a commori epitope(s). The results suggest that the 
cDNA sequence encoding this NANBV epitope{s) is one which Is present in both clone 5-1-1 and in clone 
81. 

l^\A.H. Characterizatioh Of HCV 



IV.H.1. Characterization of the Strandedness of the HCV Genome. 

The HCV genome was characterized with respect to its strandedness by isolating the nucleic acid 
fraction from particles captured on anti-NANBs-i-i antibody coated polystyrene beads, and determining 
whether the isolated nucleic acid hybridized with plus and/or minus strands of HCV cDNA. 

Particles were captured from HCV infected chimpanzee plasma using polystyrene beads coated with 
immunopurlfled anti-NANBs-i-i antibody as described in Section IV.F.1. The nucleic acid component of 
the particles was released using the method described in Section IV.F.2. Aliquots of the isolated genomic 
nucleic acid equivalent to 3 mis of high titer plasma were blotted onto nitrocellulose filters. As controls, 
aliquots of denatured HCV cDNA from clone 81 (2 picograms) was also blotted onto tiie same filters. The 
filters were probed witii ^P-labe!ed mixture of plus or mixture of minus strands of single stranded DNA 
cloned from HCV cDNAs: the cDNAs were excised from clones 40b, 81, and 25c. 

The single stranded probes were obtained by excising the HCV cDNAs from clones 81, 40b. and 25c 
witii EcoRi. and cloriing the cDNA fragments in M13 vectors, mp18 and mp19 [Messing (1983)]. The M13 
clones were sequenced to determine whether they contained the plus or minus strands of^DNA derived 
from ttie HCV cDNAs. Sequencing was by tiie dideoxychain termination metinod of Sanger et al. (1977). 

Each of a set of duplicate filters containing aliquots of tine HCV genome isolated from the captured 
particles was hybridized witii either plus or minus strand probes derived from tiie HCV cDNAs. Rg. 41 
shows the autoradiographs obtained from probing tiie NANBV genome with the nriixtuVe of probes derived 
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from clones 81, 40b. and 25c. This mixture was used to increase the sensitivity of the hybridization assay. 
The samples in panel I were hybridized with the plus strand probe mixture. The samples in pane! II were 
probed by hybridization with the minus strand probe mixture. The composition of the samples in the panels 
of the immunoblot are presented in table 4. * 

5 ' ... 

Table 4 



lana 


A 


B 


1 


HCV genome 


* 


2 






3 


« 


cDNA 81 


4 




cDNA 81 



* is an undescribed sample. 



As seen from the results in Rg. 41, only the minus strand DNA probe hybridizes with frie isolated HCV 
genome. This result, in combination with the result showing that the genome rs sensitive to RNase and not 
DNase (See Section IV.C.2.). suggests that the genome of NANBV is positive stranded RNA. 
20 These data, and data from other laboratories concerning the physicochemical properties of a putative 
NANBV{s). are consistent with the possibility that HCV is a member of tiie Flaviviridae. However, tiie 
possibility that HCV represents a new class of viral agent has not been eliminated. 



25 IV.H.2. Detection of Sequences in Captured Particles 

Which When Amplified by PCR Hybridize to HCV cDNA Derived from Clone 81 

The RNA in captured particles was obtained as described In Section IV.H.1. The analysis for sequences 
which hybridize to the HCV cDNA derived from clone 81 was canried out utilizing the PCR amplification 
30 procedure, as described in Section IV.C.3, except tiiat the hybridization probe was a kinased oligonucleotide 
derived from tiie clone 81 cDNA sequence. The results showed that the amplified sequences hybridized 
with the clone 81 derived HCV cDNA probe. 

iV.H.3. Homology Between the Non-Stnjctural Protein of Dengue Ravivirus (MNWWVD1) and the HCV 
35 Polypeptides Encoded by the Combined ORF of Clones 141 Through 39c " 

The combined HCV cDNAs of clones 141 tiirough 39c contain one continuous ORF. as shown In Rg. 26. 
The polypeptide encoded tiierein was analyzed for sequence homology with the region of tiie non-structural 
polypeptide(s) in Dengue flavivirus (MNWVD1), The analysis used tiie Dayhoff protein data base, and was 

40 performed on a computer. The results are shown in Rg. 42. where the symbol (:) indicates an exact 
homology, and the symbol (.) indicates a conservative replacement in the sequence; the dashes Indicate 
spaces inserted into the sequence to achieve the greatest homologies. As seen from the figure, there is 
significant homology between the sequence encoded in tiie HCV cDNA, and tiie non-stinctural polypeptide- 
(s) of Dengue virus. In addition to the homology shown in Rg. 42, analysis of the polypeptide segment 

45 encoded in a region towards the 3 -end of the cDNA also contained sequences which are homologous to 
sequences in the Dengue polymerase. Of consequence is tiie finding that the canonical Gly-Asp-Asp (GDD) 
sequence tiiought to be essential for RNA-dependent RNA polymerases Is contained in the polypeptide 
encoded in HCV cDNA, in a location which is consistent with that in Dengue 2 virus. (Data not shown.) 

50 

IV.H.4. HCV-DNA is Not Detectable in NANBH Infected Tissue 

Two types of studies provide results suggesting that HCV-DNA is not detectable in tissue from an 
individual with NANBH. These results, in conjunction witii tiiose described in IV.C. and IV.H.1. and IV.H.2. 
55 provide evidence ttiat HCV is not a DNA containing virus, and that its replication does not involve cDNA. 



IV.H.4.a. Southern Blotting Procedure 
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In order to determine whether NANBH infected chimpanzee liver contains detectable HCV-DNA (or 
HCV-cDNA), restriction enzyme fragments of DNA isolated from this source was Southern blotted, and the 
blots probed with ^^p-iabeled HCV cDNA. The results showed that the labeled HCV cDNA did not hybridize 
to the blotted DNA from the infected chimpanzee liver. It also did not hybridize to control blotted DNA prom 
5 normal chimpanzee liver. In contrast, in a positive control, a labeled probe of the beta-interferon gene 
hybridized strongly to Southern biots of restriction enzyme digested human placental DNA. These systems 
were designed to detect a single copy of the gene which was to be detected with the labeled probe. 

DNAs were isolated from the livers of two chimpanzees with NANBH. Control DNAs were isolated from 
uninfected chimpanzee liver, and from human placentas.. The procedure for extracting DNA was essentially 
10 according to Maniatis et al. (1982), and the DNA samples were treated with RNAse during the isolation 
procedure. 

Each DNA sample was treated with either EcoR!, MboL or HinCII (12 micrograms), according to the 
manufacturer's directions. The digested DNAs were electrophoresed on 1% neutral agarose gels, Southern 
blotted onto nitrocellulose, and the blotted material hybridized with the appropriate nick-translated probe 
IS cDNA (3x10^ cpm/ml of hybridization mix). The DNA from infected chimpanzee liver and normal liver were 
hybridized with ^^p-iabeied HCV cDNA from clones 36 plus 81; the DNA from human placenta was 
hybridized with ^P-labeled DNA from the beta-interferon gene. After hybridization, the blots were washed 
under stringent conditions, i.e.. with a solution containing 0.1 x SBC, 0.1% SDS. at 65* C. 

The beta-interferon gene DNA was prepared as described by Houghton et al (1981). 

20 

IV.H.4.b. Amplification by the PCR Technique 

In order to determine whether HCV-DNA could be detected in liver from chimpanzees with NANBH. 

25 DNA was isolated from the tissue, and subjected to the PCR amplification-detection technique using 
primers and probe polynucleotides derived from HCV cDNA from clone 81. Negative controls were DNA 
samples isolated from uninfected HepG2 tissue culture cells, arid from presumably uninfected human 
placenta. Positive controls were samples of the negative control DNAs to which a known relatively small 
amount (250 molecules) of the HCV cDNA insert from clone 81 was added. 

30 in addition, to confirm that RNA fractions isolated from the same livers of chimpanzees with NANBH 
contained sequences complementary to the HCV-cDNA probe, the PCR amplification-detection system was 
also used on the isolated RNA samples. 

In the studies, the DNAs were isolated by the procedure described in Section IV.H.4.a. and RNAs were 
extracted essentially as described by Chirgwin et al. (1981). 

35 Samples of DNA were isolated from 2 infected chimpanzee livers, from uninfected HepG2 cells, and 
from human placenta. One microgram of each DNA was digested with Hindlll according to the manufac- 
turer's, directions. The digested samples were subjected to PCR amplification and detection for amplified 
HCV cDNA essentially as described in Section IV.C.3., except that the reverse transcriptase step was 
omitted. The PCR primers and probe were from HCV cDNA clone 81. and are described in Section IV.C.3.. 

40 Prior to the amplification, for positive controls, a one microgram sample of each DNA was "spiked" by the 
addition of 250 molecules of HCV cDNA insert isolated from clone 81. 

In order to determine whether HCV sequences were present in RNA isolated from the livers of 
chimpanzees with NANBH, samples containing 0.4 micrograms of total RNA were subjected to the 
amplification procedure essentially as described in Section IV.C.3., except that the reverse transcriptase 

45 was omitted from some of the samples as a negative control. The PCR primers and probe were from HCV 
cDNA clone 81. as described supra. 

The results showed that amplified sequences complementary to the HCV cDNA probe were not 
detectable In the DNAs from infected chimpanzee llvewr. nor were they detectable in the negative controls. In 
contrast, when the samples, including the DNA from infected chimpanzee liver, was spiked with the HCV 

60 cDNA prior to amplification, the clone 81 sequences were detected in all positive control samples* In 
addition, in the RNA studies, amplified HCV cDNA clone 81 sequences were detected only when reverse 
transcriptase was used, suggesting strongly that the results were not due to a DNA contamination. 

These results show that hepatocytes from chimpanzees with NANBH contain no, or undetectable levels, 
of HCV DNA. Based upon the spiking study, if HCV DNA is present, it is at a level far below .06 copies per 

55 hepatocyte. In contrast, the HCV sequences in total RNA from the same liver samples was readily detected 
with the PCR technique. 
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IV.I. ELISA Determinations for HCV Infection Using HCV c100-3 As Test Antigen 



All samples were assayed u^ng the HCV c100-3 ELISA. This assay utilizes the HCV c100-3 antigen 
{which was synthesized and purified as described in Section IV.B.5), and a horseradish peroxidase (HRP) 

5 conjugate of mouse monoclonal anti-human IgG. 

Plates coated with the HCV c100-3 antigen were prepared as follows. A solution containing Coating 
buffer (50mM Na Borate. pH 9.0). 21 ml/plate, BSA (25 micrograms/ml). clOO-3 (2.50 micrograms/ml) was ^ 
prepared just prior to addition to the Removeawell Immulon I plates (Dyhatech Corp.). After mixing for 5 
minutes. 0.2ml/well of the solution was added to the plates, they were covered and incubated for 2 hours at 

10 37* C. after which the solution was removed by aspiration. The wells were washed once with 400 microliters 
Wash B^er (100 mM sodium phosphate, pH 7.4, 140 mM sodium chloride, 0.1% (WAO casein, 1% (W/V) 
Triton x-100, 0.01% (WAO Thimerosai). After removal of the wash solution, 200 microltters/weH of Postcoat 
solution (10 mM sodium phosphate. pH 7.2, 150 mM sodium chloride, 0.1% (w/v) casein and 2 mM 
phenylmethylsulfonytfluoride (PMSF)) was added, the plates were loosely covered to prevent evaporation, 

15 and were allowed to stand at room temperature for 30 minutes^ The wells were then aspirated to remove 
the solution, and iyophiltzed dry ovemight, without shelf heating. The prepared plates may be stored at 2- 
8* C in sealed aiuminunn pouches. 

!n order to perform the EUSA determination, 20 microliters of serum sample or control sample was 
added to a well containing 200 microliters of sample diluent (100 mM sodium phosphate, pH 7.4. 500 mM 

20 sodium chloride, 1 mM EDTA, 0.1% (WA/) Casein, 0.015 (W/V) Therosal. 1% (WA/) Triton X-100. 100 
micrograms/ml yeast extract). The plates were sealed, and incubated at 37* C for two hours, after which the 
solution was removed by aspiration, and the wells were washed with 400 microliters of wash buffer 
(phosphate buffered saline (PBS) contaning 0.05% Tween 20). The washed wells were treated with 200 
microliters of mouse anti-human IgG-HRP conjugate contained in a solution of Ortho conjugate diluent (10 

25 mM sodium phosphate, pH 7.2. 150 mM sodium chloride, 50% (V/V) fetal bovine serum, 1% (V/V) heat 
treated horse serum. 1 mM K3Fe(CN)6. 0.05% (W/V) Tween 20. 0.02% (WA/) Thimerosai). Treatment was 
for 1 hour at 37* C. the solution was removed by aspiration, and the wells were washed with wash buffer, 
which was also removed by aspiration. To determine the amount of bound enzyme conjugate, 200 
microliters of substrate solution (10 mg 0-phenylenedlamine dihydrochloride per 5 ml of Developer solution) 

30 was added. Developer solution contains 50 mM sodium citrate adjusted to pH 5.1 with phosphoric acid, and 
0.6 microliters/ml of 30% H2O2. The plates containing the substrate solution were incubated in the dark for 
30 minutes at room temperature, the reactions were stopped by the addition of 50 microliters/ml 4N sulfuric 
acid, and the CDs determined. 

The examples provided below show that the microtiter plate screening ELISA which utilizes HCV c100-3 

35 antigen has a high degree of specificity, as evidenced by an initial rate of reactivity of about 1%, with a 
repeat reactive rate of about 0.5% on random donors. The assay is capable of detecting an im- 
munoresponse in both the post acute phase of the infection, and during the chronic phase of the disease. In 
addition, the assay is capable of detecting some samples which score negative in the surrogate tests for 
NANBH; these samples come frorn Individuals with a history of NANBH, or from donors implicated in 

40 NANBH transmission. 

In the examples described below, the folbwing abbreviations are used: 
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ALT 


Alanine amino transferase 


Anti-HBc 


Antibody against HBc 


Anti-HBsAg 


Antitjody against HBsAg 


HBc 


Hepatitis B core antigen 


ABsAg 


Hepatitis B surface antigen 


IqG 


Immunogtobulin G 


IgM 


Immunoglobuiin M 


lU/L 


International units/Liter 


NA 


Not available 


NT 


Not tested 


IN 


oampie size 


Neg 


Negative 


OD 


Optical density 


Pos 


Positive 


S/CO 


Signai/cutoff 


SD 


Standard deviation 


X 


Average or mean 


WNL 


Witfiin nomr^ai limits 



IV.L1 . HCV Infection in a Population of Random Blood [honors 

25 A group of 1.056 samples (fresh sera) from random blood donors were obtained from Irwin Memorial 
Blood Bank. San Francisco. Califomia. The test results obtained with these samples are summarized in a 
histogram showing the distribution of the OD values (Rg. 43). As seen in Fig. 43, 4 samples read >3. 1 
sample reads between 1 and 3, 5 samples read between 0.4 and 1. and the remaining 1,046 samples read 
<0.4. with over 90% of these samples reading <0.1. 

30 The results on the reactive random samples are presented in Table 5. Using a cut-off value equal to the 
mean plus 5 standard, deviations, ten samples out of the 1,056 (0.95%) were initially reactive. Of these, five 
samples (0.47%) repeated as reactive when they were assayed a second time using thei ELISA. Table 5 
also shows the ALT and Anti-HBd status for each of the repeatedly reactive samples. Of particular interest 
is the fact that all five repeat reactive samples were negative in both surrogate tests for NANBH. while 

35 scoring positive in the HCV EUSA, 
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TABLE 5 



75 



20 



RESULTS ON REACTIVE RANDOM SAMI*LES 


N = 1051 










5r= 0.049* 










SD = ± 0.074 








Cut-off: 5r+ 5SD = 0,419 (0.400 


+ Negative Control) 




Samples 


initial Reactlves OD 


Repeat Reactives 


ALT*" 


Anil HRr ^ 






OD 


(lU/L) 


(OD) 


4227 


0.462 


0.084 


NA 


NA 


6292 


0.569 


0.294 


NA 


NA 


6188 


0.699 


0.326 


NA 


NA 


6157 


0.735 


0.187 


NA 


NA 


6277 


0.883 


0.152 


NA 


NA 


6397 


1.567 


1.392 


30.14 


1.433 


6019 


>3.000 


>3.000 


46.48 


1.057 


6651 


>3.000 


>3.000 


48.53 


1.343 


6669 


>3.000 


>3.000 


60.53 


1.165 


4003 


>3.000 


3.000 


WNL*^ 


Negative 




10/1056 = 0.95% 


5/1056 = 0.47% 







• Samples reading >1 .5 were not included in calculating the Mean and SD 
25 - ALT S 68 lU/L is above normal limits. 

Anti-HBc ^ 0.535 (competition assay) is considered positive. 
™ WNL: Witfiin normal limits. 
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IV.L2. Chimpanzee Serum Samples 

Serum samples from eleven chimpanzees were tested with the HCV c100-3 ELISA. Four of these 
chimpanzees were infected with NANBH from a contaminated batch of Factor Vill (presumably Hutchinson 
strain), following an established procedure in a collaboration with Dr. Daniel Bradley at the Centers for 
Disease Control. As controls, four other chimpanzees were infected with HAV and three with HBV. Serum 
samples were obtained at different times after infection. 

The results, which are summarized In Table 6, show documented antibody seroconversion in all 
chimpanzees infected with the Hutchinson strain of NANBH. Following the acute phase of infection (as 
evidenced by the significant rise and subsequent return to normal of ALT levels), antibodies to HCV c100-3 
became detectable in the sera of the 4/4 NANBH infected chimpanzees. These samples had previously 
,^ been shown, as discussed In Section IV.B-3., to be positive by a Western analysis, and an RIA. In contrast 
none of the control chimpanzees which had been infected with HAV or HBV showed evidence of reactivity 
in the ELIS/V. 
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CHIMRftNZEE SERUM SAMPLES 



70 



NecAT I vt Control 
Positive Control 
Cutoff 



CD 



0*001 
1.501 
0.401 



s/co 



Imoculatioh 

OftTE 



Bleed 
Date 



ALT 
CIU/L) 



Tramsfused 



75 



zo 



25 



30 



35 



40 



Chimp 1 



Chimp 2 



Chimp 3 



Chimp 4 



Chimp 5 



Chimp 6 



Chimp 7 



45 



-0.007 
0.005 
>3.000 
>3.000 



-0.005 
-0.005 
0.915 
>3.000 

0.017 
0.006 
1.010 

-0.006 
0.003 
0.523 
U57n 

-0.005 
0.001 
0.003 
0.006 



-0.005 
O.OOi 

-0.004 
0.290 

-0.008 

-d.ooa 

-0.006 
0.005 



0.00 
O.OI 
>7.48 

>7.n8 



0.00 
0.00 
2.36 
>7.18 

0.01 

o.on 

0.01 
2.52 



0.00 
0.00 
0.01 
0.01 



0.00 
0.00 
0.00 
0.72 

0.00 
0.00 
0.00 
0.01 



05/24/84 



06/07/84 



03/14/85 



05/11/85 



11/21/80 



05/25/82 



05/25/82 



05/24/84 


9 


08/07/81 


71 


09/18/84 


19 


10/2A/84 




05/31/84 


5 


06/28/84 


52 


08/20/84 


15 


10/24/81 




03/14/85 


8 


04/25/85 


205 


05/05/85 


14 


08/20/85 


g 


03/11/85 


11 


05/09/85 


132 


05/06/85 




08/01/85 




11/21/80 


4 


12/15/80 


147 


12/30/80 


18 


07/29 - 08>71/81 


5 


05/17/82 




06/10/82 


lOG 


07/06/82 


10 


10/01/82 


05/2S/82 


7 


06/17/B2 


83 


09/16/82 


5 


lb/09>^ 





NANB 



HAf^B 



HAND 



NANB 



HAV 



HAV 



KAY 
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TSBLE b 
CHI^gAN2EE SERUM SRMPl£S 
(Canted) 



10 



IS 



20 



25 



30 



Chimp 8 



OP 

-0.007 
0.000 
0.004 
0.000 



Chimp 9 



s/co 

0.00 
0.00 
0.01 
0.00 



0.019 0.05 

0.015 o.on 

O.OOS 0.02 



Chimp 10 — — 

O.OU 0.05 

0.015 0.01 

0.008 0.02 

O.OiO 0.02 

Chimp 1 1 — — 

0.000 0.00 

*o.oa3 0.00 

-0.003 0.00 

-0.003 0.00 



[llOCULAriON 

II/21/S0 



07/24/80 



05/l2>^2 



05/12/82 



Bleed 
Date 



n/21/80 
12/16/80 
02/05/81 
0&/03 - 06/10/81 



08/22 - 10/10/79 
03/11/81 

07/01 - 08/05/81 
10/01/81 



04/21 - 05/12/82 
09/01 - 09/08/82 
12/02/82 
01/06/83 



01/00 - 05/12/82 
06/23/82 

06/09 - 07/07/82 
10/28/82 
12/20/82 



ALT 
(fU/L) 

15 
130 
8 

4.5 



57 
9 
6 



9 
126 
9 
13 



11 
100 

9 
10 



Transfused 
IIAV 

HBV 



IIDV 



HBV 
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r/.L3. Panel 1: Proven Infectious Sera frpm Chronic Human NANBH Carriers 

A coded panel consisted of 22 unique samples, each one in duplicate, for a total of 44 samples. The 
samples were from proven infectious sera from chronic NANBH carriers, infectious sera from implicated 
40 donors, and infectious sera from acute phase NANBH patients. In addition, the samples were from highly 
pedigreed negative controls, and other disease controls. This panel was provided by Dr. H. Alter of the 
Department of Health and Human Services, National Institutes of Health. Bethesda. Maryland, The panel 
^ was constructed by Dr. Alter several years ago, and has been used by Dr. Alter as a qualifying panel for 
putative NANBH assays. 

45 The entire panel was assayed twice with the EUSA assay, and the results were sent to Dr. Alter to be 
scored. The results of the scoring are shown In Table 7. Although the Table reports the results of only one 
set of duplicates, the same values were obtained for each of the duplicate samples. 

As shown in Table 7, 6 sera which were proven infectious in a chimpanzee model were strongly 
positive. The seventh infectious serum corresponded to a sample for an acute NANBH case, and was not 

50 reactive in this ELISA. A sample from an implicated donor with both normal ALT levels and equivocal 
results in the chimpanzee studies was non-reactive in the assay. Three other serial samples from one 
individual with acute NANBH were also non-reactive. All samples coming from the highly pedigreed 
negative controls, obtained from donors who had at least 10 blood donations withoiit hepatitis impiication. 
were non-reactive in the EUSA. Finally, four of the samples tested had previously scored as positive in 

55 putative NANBH assays developed by others, but these assays were not conftrmable. These four samples 
scored negatively with the HCV ELISA. 
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TABLE 7 



30 



H.ALTER'S PANEL 1: 


PANEL 


1ST 


2ND 




RESULT 


RESULT 


1) PROVEN INFECTIOUS BY CHIIVIPAN2EE 


TRANSMISSION 






A. CHRONIC NANB: POST-TX 


JF 


+. 


+ 


EE 


+ 


+ 


PG 


+ 


+ 


B. IMPUCATED DONORS WITH ELEVATED 


ALT 






BC 


+ 


+ 


JJ 




+ 


BB 


+ 




C. ACUTE NANB: POST-TX 


WH 






2) EQUIVOCALLY INFECTIOUS BY 




CHIMPANZEE TRANSMISSION 




A. IMPLICATED DONOR WIN NORMAL ALT 


CC 






3) ACUTE NANB: POST-TX 


JL \A^EK 1 






JL V\^EK2 






JL V\^EK3 






4) DISEASE CONTROLS 


A. PRIMARY BILIARY CIRRHOSIS 


EK 






B. ALCOHOLIC HEPATITIS IN RECOVERY 


HB 






5) PEDIGREED NEGATIVE CONTROLS 


DH 






DC 






LV. 






HL 






AH 






6) POTENTIAL NANB "ANTIGENS" 


JS-80-OIT-0 (ISHIDA) 






ASTERIX (TREPO) 






ZURTZ (ARNOLD) 






BECASSDINE (TREPO) 







55 

IVJ.4. Panel 2: Donor/Recipient NANBH 
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The coded panel consisted of 10 unequivocal donor-recipient cases of transfusion associated NANBH» 
witli a tot^ of 188 samples. Ea,ch case consisted of samples of some or all the donors to the recipient, and 
of serial samples (drawn 3. 6. and 12 months after transfusion) from the recipient Also included was a pre- 
bleed. drawn from the recipient before transfusion. The coded panel was provided by Dr. H. Alter, from the 

5 NIH. and the results were sent to him for scoring. 

The results, which are summarized In Table 8, show that the EUSA detected antibody seroconversion 
in 9 of 10 cases of transfusion associated NANBH. Samples from case 4 (where no seroconversion was 
detected), consistently reacted poorly in the ELISA. Two of the 10 recipient samples were reactive at 3 
months post transfusion. At six months. 8 recipient samples were reactive; and at twelve months, with the 

10 exception of case 4, all samples were reactive. In addition, at least one antibody positive donor was found in 
7 out of the 10 cases, with case 10 having two positive donors. Also, in case 10, the recipient's pre-bleed 
was positive for HCV antibodies. The one month bleed from this recipient dropped to borderline reactive 
levels, while it was elevated to positive at 4 and 10 month bleeds. Generally, a S/CO of 0.4 is considered 
positive. Thus, this case may represent a prior infection of the individual with HCV. 

15 The ALT and HBc status for ail the reactive, i.e., positive, samples are summarized in Table 9. As seen 
in the table. 1/8 donor samples was negative for frie surrogate markers and reactive in the HCV antibody 
ELISA. On the other hand, the recipient samples (followed up to 12 months after transfusion) had either 
elevated ALT; positive Anti-HBc. or both. 

20 TABLE 8 



DONOR/RECIPIENT NANB PANEL 



H. ALTER DONORmECIPIENT NANB PANEL 



CASE 


DONOR 


RECIPIENT 
PREBLEED 


POST-TX 


OD 


S/CO 


00 


S/CO 


3 MONTHS 


6 MONTHS 


12 MONTHS- 


OD 


S/CO 


OD 


S/CO 


OD 


S/CO 


1. 






.032 


0.07 


,112 


0,26 


>3.000 


>6.96 


>3,000 


>6.96 


2, 






.059 


0.14 


.050 


0.12 


1.681 


3.90 


>3,000 


>6.96 


3. 


.403 


0.94 


.049 


0.11 


\057 


0.13 


>3.000 


>6.96 


>3.000 


>6.96 


4. 






,065 


0.15 


.073 


0.17 


.067 


0.16 


.217 


0.50 


5. 


>3.000 


>6.96 


.034 


0.08 


.096 


0.22 


>3.000 


>6.96 


>3.000 


>6.96 


6. 


>3.000 


>6.96 


.056 


0.13 


1.475 


3.44 


>3.000 


>6.96 


>3.000 


>6.96 


7. 


>3.000 


>6.96 


.034 


0.08 


.056 


0.13 


>3.000 


>6.96 


>3.000 


>6.96 


8- 


>3.000 


>6.96 


.061 


0.14 


.078 


0.18 


2.262 


5,28 


>3.000 


>6.96 


9. 


>3.000 


>6.96 


.080 


0.19 


.127 


0.30 


.055 


0.13 


>3.000 


>6.96 


10. 


>3.000 


>6.96 


>3.000 


>6.96 


.317" 


0.74 


>3.000** 


>6.96 


>3.000'* 


>6.96 




>3.000 


>6.96 



















45 



* 1 MONTH. 

4 MONTHS. 
~*10 MONTHS 



50 
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TABLE 9 





ALT AND HBc STATUS FOR REACTIVE SAMPLES IN 


5 




K ALTlER PANEL 1 






Samples 


Anti-ALT 


HBc** 




Donors 




Cass 3 




Normal 


Negative 


70 


Case 5 




Elevated 


Positive 




Case 6 




Elevated 


Positive 




Case 7 




Not available 


Negative 




Case 8 




Normal 


Positive 




Case 9 




Elevated 


Not available 


15 


Case 10 




Normal 


Positive 




Case 10 




Normal 


Positive 




Recipients 




Case 1 


6 mo 


Elevated 


Positive 


20 


12 mo 


Elevated 


Not tested 






Case 2 


6 mo 


Elevated 


Negative 




12 mo 


Elevated 


Not tested 






Case 3 


6 mo 


Normal 


Not tested*" 




12 mo 


Elevated 


Not tested^ 




25 


Case 5 


6 mo 


Elevated 


Not tested 




12 mo 


Elevated 


Not tested 






Case 6 


3 mo 


Elevated 


Negative 




6 mo 


Bevated 


Negative 






12 mo 


Elevated 


Not tested 




30 


Case 7 


6 mo 


Elevated 


Negative 




12 mo 


Elevated 


Negative 






Case 8 


6 mo 


Normal 


Positive 




12 mo 


Elevated 


Not tested 






Case 9 


12 mo 


Elevated 


Not tested 


35 


Case 10 


4 mo 


Elevated 


Not tested 




10 mo 


Elevated 


Not tested 





* ALT ^45 lU/L is above normal limits. 

"* Anti-HBc ^50% (competition assay) is considered positive. 

*** Prebleed and 3 mo samples were negative for HBc. 



IV.L5. Determination of HCV infection in High Risk Group Samples 

45 

Samples from high risk groups were monitored using the ELISA to determine reactivity to HCV cl00-3 
antigen. These samples were obtained from Dr. Gary Tegtmeier, Community Blood Bank. Kansas City. The 
results are summarized in Table 10. 

As shown in the table, the samples with the highest reactivity are obtained from hemophiliacs (76%). In 
50 addition, samples. from individuals with elevated ALT and positive for Anti-HBc. scored 51% reactive, a 
value which is consistent with the value expected from clinical data and NANBH prevalence in this group. 
The incidence of antibody to HCV was also higher in blood donors with elevated ALT alone, blotdd donors 
positive for antibodies to Hepatitis B core alone, and in blood donors rejected for reasons other than high 
ALT or anti-core antibody when compared to random volunteer donors. 

55 
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TABLE 10 



NANBH HIGH RISK GROUP SAMPLES 


Group 


N 


Distribution 


% Reactive 




N 


00 


Elevated ALT 


35 


3 


> 3.000 


11.4% 


1 


0.728 








Anti-HBc 


24 


c 




20.8% 


Elevated ALT. Anti-HBc 


33 


f ^ 




51.5% 


1 


2.768 








1 


2.324 








1 


0.939 








1 


0.951 








1 


0.906 








Rejected Donors 


25 


5 


>3.000 


20.0% 


Donors with History of Hepatitis 


150 


19 


> 3.000 


14.7% 


1 


0.837 








1 


0.714 








1 


0.469 








Haemophiliacs 


50 


31 


>3.000 


76.0% 


1 


2.568 








1 


2.483 








1 


2.000 








1 


1.979 








1 


1.495 








1 


1209 








1 


0.819 









IV.L6 Comparative Studies Using Anti^igG or Anti-lgM Monoclonal Antibodies, or Potyclonal Antibodies as a 
Second Antibody Tn Ti^HC^^cT^^ 

The sensitivity of the EUSA determination which uses the anti-IgG rhonoclonal conjugate was compared 
to that obtained by using either an anti-IgM monoclonal conjugate, or by replacing both with a polyclonal 
antiserum reported to be both heavy and light chain specified. The following studies were perfonmed. 

40 

IV.I.6.a. Serial Samples from Seroconverters 

Serial samples from three cases of NANB seroconverters were studied in the HCV c100-3 ELISA assay 
using in the enzyme conjugate either the anti-lgQ monoclonal aione» or in combination with an anti-IgM 
monoclonal, or using a polyclonal antiserum. The samples were provided by Dr. Cladd Stevens, N.Y. Blood 
Ceriter, N.Y.C., N.Y.. The sample histories are shown in Table 11. ^ 

The results obtained using an anti-IgG monoclonal antibody-enzyme conjugate are shown in Table 12. 
The data shows that strong reactivity Is initially detected in samples 1-4, 2-8, and 3-5, of cases 1» 2, and 3, 
respectively. 

The results obtained using a combination of an anti-IgG mortoctonal conjugate and an anti-IgM 
conjugate are shown in Table 13. Three different ratios of anti-tgG to anti-IgM were tested; the 1:10.000 
dilution of anti-IgG was constant throughout Dilutions tested for ttie anti-IgM monoclonal conjugate were 
1:30,000, 1:60.000. and 1:120,000. The data showis tiiat. in agreement with the studies with anti-IgG alone, 
initial strong reactivity is detected in samples 1-4. 2-8. and 3-5. 

The results obtained with the ELISA using anti-IgG monoclonal conjugate (1:10,000 dilution), or Tago 
polydonal* conjugate (1:80,000 dilution), or Jackson polyclonal conjugate (1:80,000 dilution) are shown in 
Table 14, The data Indicates that initial strong reactivity is detected in samples 1-4, 2-8, and 3-5 using all 
three configurations; tine Tago polyclonal antibodies yielded the lowest signals. 
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The results presented above show that all three configurations detect reactive samples at the same time 
after the acute phase of the disease (as evidenced by the ALT elevation). Moreover, the results indicate that 
the sensitivity of the HCV c100-3 ELISA using anti-IgG monoclonal-enzyme conjugate is equal to or better 
than that obtained using the other tested configurations for the ehzynrie conjugate. 

TABLE 11 

DESCRIPriON OF SAMPLES FROM CLADD STEVENS PANEL 



75 



20 



25 



30 



35 



40 



0*c t H>»A< Ahtt»HB« 

3-T e/5/ai 1.0 91.7 

10/7/81 i.O U.O 

1-4 U/mai KO «7,3 

1-5 l2/15/ai 1.0 50.5 

Cat* 2 



3-1 
3-2 

45 3-3 
3-4 
3-5 
3-6 

50 3-7 
3-8 
3-9 



55 



L-HBe 


ALT 


Bi L iruDin 


12.9 


40.0 


-1.0 


IS.l 


274.0 


I. A 


23.8 


261.0 


0,9 


33.8 


75.0 


0.9 


27.6 


^71.0 


1.0 



2-1 


10/19/81 


1.0 


1.0 


116.2 


17.0 


-1.0 


2-2 


a/17/81 


1.0 


0.8 


89.3 


46.0 


1. 1 


2-3 


12/02/81 


1.0 


1.2 


78.3 


63.0 


1.4 


2-^ 


12/14/81 


1.0 


0.9 


90.6 


152.0 


1.4 


2-5 


12/23/81 


1.0 


0.8 


93.6 


624.0 


1.7 


2-6 


1/20/82 


1.0 


0.8 


92.9 


66.0 


1.5 


2-7 


2/15/82 


1.0 


0.8 


86.7 


70.0 


1.3 


2-8 


3/17/82 


1.0 


0.9 


69.8 


24.0 


-1.0 


2-9 


4/21/82 


1.0 


0.9 


67.1 


S3.0 


1.5 


2-10 


3/19/82 


1.0 


0.5 


74.8 


95.0 


1.6 


2-11 


4/14/82 


1.0 


0.8 


82.9 


37.0 


-1.0 



4/7/81 


1.0 


1.2 


68.4 


13.0 


-1.0 


5/12/81 


1*0 


1.1 


126.2 


' 236.0 


0.4 


5/30/81 


1.0 


0.7 


99.9 


471.0 


0.2 


4/9/81 


1.0 


1.2 


110.8 


315.0 


0.4 


7/6/81 


UO 


Ul 


89.9 


273.0 


0.4 


1/10/81 


1.0 


1.0 


118.2 


158.0 


0.4 


9/8/81 


1.0 


1.0 


112.3 


84.0 


0.3 


10/14/81 


1.0 


0.9 


102.5 


180.0 


0.5 


11/11/81 


1.0 


1.0 


84.6 


154.0 


0.3 
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TABLE 12 



ELISA RESULTS OBTAINED USING AN ANTI-IgG 
MONOCLONAL CONJUGATE 



SAMPLE 


DATE 


ALT 


OD 


S/CO 


NEG CONTROL 






.076 




CUTOFF 






.476 




PC (1:128) 






1.390 




CASE#1 


1-1 


08/05/81 


40.0 


.178 


.37 


1-2 


09702/81 


274.0 


.154 


.32 


1-3 


10/07/81 


261.0 


.129 


.27 


1-4 


11/19/81 


75.0 


.937 


1.97 


1-5 


12/15/81 


71.0 


>3.000 


>6.30 


CASE #2 


2-1 


^10/19/81 


17.0 


.058 


0.12 


2-2 


11/17/81 


46.0 


.050 


0.11 


2-3 


12/02/81 


63.0 


.047 


0.10 


2-4 


12/14/81 


152.0 


.059 


0.12 


2-5 


12/23/81 


624.0 


.070 


0.15 


2-6 


01/20/82 


66.0 


.051 


0.11 


2-7 


02/15/82 


70.0 


.139 


0,29 


2-8 


03/17/82 


24.0 


1.867 


3.92 


2-9 


04/21/82 


53.0 


>3.000 


>6.30 


2-10 


05/19/82 


95.0 


>3.000 


>6.30 


2-11 


08/14/82 


37.0 


>3.000 


>6.30 


CASE #3 


3-1 


04/07/81 


13.0 


.090 


.19 


3-2 


05/12)81 


236.0 


.064 


.13 


3-3 


05/30/81 


471.0 


.079 


.17 


3-4 


06/09/81 


315.0 


.211 


.44 


3-5 


07/06/81 


273.0 


1.707 


3.59 


3-6 


08/10/81 


158.0 


>3.000 


>6.30 


3-7 


09/08/81 


84.0 


>3.000 


>6.30 


3-8 


10/14/81 


180.0 


>3.000 


>6.30 


3-9 


11/11/81 


154.0 


>3.000 


>6.30 
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TABLE 13 



30 



ELISA RESULTS OBTAINED USING ANTI-IgG and ANTI-IgM MONOCLONAL CONJUGATE 


SAMPLE 


DATE 


ALT 


NANBELISAS 








MONOCLONALS 


MONOCLONALS 


MONOCLONALS IGG 








IGG 1:10k IGM 


IGG 1:1 OK IGM 


1:10K IGM 1:120K 








1:3dk 




1:60K 














OD 


S/CO 


OD 


S/CO 


OD 


S/CO 


NEG CONTROL 






.100 




.080 




.079 




CUTOFF 


















PC (1:128) 






1.083 




1.328 




1.197 




CASE#1 


1-1 


08/05/81 


40 


.173 




.162 




.070 




1-2 


09/02/81 


274 


.194 




.141 




.079 




1-3 


10/07/81 


261 


.162 




.129 




.063 




1-4 


11/19/81 


75 


.812 




.85 




.709 




1-5 


12/15/81 


71 


>3.0d 




>3.00 




>3.00 




Case #2 


2-1 


10/19/81 


17 


.442 




.045 




.085 




2-2 


11/17/81 


46 


.102 




.029 




.030 




2-3 


12/02/81 


63 


.059 




.036 




.027 




2-4 


12/14/81 


152 


.065 




.041 




.025 




2-5 


12/23/81 


624 


.082 




.033 




.032 




2-6 


01/20/82 


66 


.102 




.042 




.027 




2-7 


02/15/82 


70 


.188 




.068 




.096 




2-8 


03/17/82 


24 


1.728 




1.668 




1.541 




2-9 


04^^1/82 


53 


>3.00 




2.443 




>3.00 




2-10 


05/19/82 


95 


>3.00 




>3.00 




>3.00 




2-11 


06/14/82 


37 


>3.00 




>3.00 




>3.00 




CASE #3 


3-1 


04V07/81 


13 


-193 




.076 




.049 




3-2 


05/12/81 


236 


.201 




.051 




.038 




3-3 


05/30/81 


471 


.132 




.067 




.052 




3-4 


06/09/81 


315 


.175 




.155 




.140 




3-5 


07/06/81 


273 


1.335 




1.238 




1.260 




3-6 


08/10/81 


158 


>3.00 




>3.00 




>3.00 




3-7 


09/08/81 


84 


>3.00 




>3.00 




>3.00 




3-8 


10/14/81 


180 


>3.00 




>3.00 




>3.00 




3-9 


11/11/81 


154 


>3.00 




>3.06 




>3.00 
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TABLE 14 



5 



10 



15 



25 



30 



35 



40 



EUSA RESULTS OBTAINED USING POLYCLONAL CONJUGATES 


SAMPLE 


DATE 


ALT 


NANB EUSAS 








MONOCLONAL 1:1 OK 


TAGO1:80K 


JACKSON 1:80K 








OD 


S/CO 


OD 


S/CO 


OD 


S/CO 


NEG CONTROL 






.076 




.045 




.154 




CUTOFF 






.476 




.545 




.654 




PC (1:128) 






1.390 




.727 




2.154 




CASE #1 


1-1 


08/05/81 


40 


.178 


.37 


.067 


.12 


.153 


.23 


1-2 


09/02/81 


274 


.154 


.32 


.097 


.18 


.225 


.34 


1-3 


10/07/81 


261 


.129 


.27 


.026 


.05 


.167 


.26 


1-4 


11/19/81 


75 


.937 


1.97 


.324 


.60 


.793 


1.21 


1-5 


12/15/81 


71 


>3.00 


>6.30 


1.778 


3.27 


>3.00 


>4.59 


CASE #2 


2-1 


10/19/81 


17 


.058 


.12 


.023 


.04 


.052 


.08 


2-2 


11/17/81 


46 


.050 


.11 


.018 


.03 


.058 


.09 


2-3 


12/02/81 


63 


.047 


.10 


.020 


.04 


.060 


.09 


2-4 


12/14/81 


152 


.059 


.12 


.025 


.05 


.054 


.08 


2-5 


12/23/81 


624 


.070 


.15 


.026 


.05 


.074 


.11 


2-6 


01/20/82 


66 


.051 


.11 


.018 


.03 


.058 


.09 


2-7 


02/15/82 


70 


.139 


.29 


.037 


.07 


.146 


.22 


2-8 


03/17/82 


24 


1.867 


3.92 


.355 


.65 


1.429 


2.19 


2-9 . 


04/21/82 


53 


>3.00 


>6.30 


.748 


1.37 


>3.00 


>4.59 


2-10 


05/19/82 


95 


>3.00 


>6.30 


1.025 


1.88 


>3.00 


>4.59 


2-11 


06/14/82 


37, 


>3.00 


>6.30 


.917 


1.68 


>3.00 


>4.59 


Case #3 


3-1 


04/07/81 


13 


.090 


.19 


.049 


.09 


.138 


.21 


3-2 


05/12/81 


236 


.064 


.13 


.040 


.07 


.094 


.14 


3-3 


05/30/81 


471 


.079 


.17 


.045 


.08 


.144 


22 


3-4 


06/09/81 


315 


.211 


.44 


.085 


.16 


.275 


.42 


3-5 


07/06/81 


273 


1.707 


3.59 


.272 


.50 


1.773 


2.71 


3-6 


08/10/81 


158 


>3.00 


>6.30 


1.347 


2.47 


>3.00 


>4.59 


3-7 


09/08/81 


84 


>3.00 


>6.30 


2.294 


4.21 


>3.00 


>4.59 


3-8 


10/14/81 


180 


>3.00 


>6.30 


>3.00 


>5.50 


>3.00 


>4.59 


3-9 


11/11/81 


154 


>3.00 


>6.30 


>3.00 


>5.50 


>3.00 


>4.59 



IV.L6.b. Samples from Random Blood Donors 

Samples from random blood donors (See Section IV.L1.) were screened for HCV infection using the 
50 HCV clOO-3 ELISA. in wi:iich the antibody-enzyme conjugate was either an anti-IgG monoclonal conjugate, 
or a polyclonal conjugate. The total number of samples screened were 1077 and 1056. for the polyclonal 
conjugate and the monoclonal conjugate, respectively. A summary of the results of the screening is shown 
in Table 15. and the sample distributions are shown in the histogram in Rg. 44. 

The calculation of the average and standard deviation was performed excluding samples that gave a 
55 signal over 1.5, i.e.. 1073 OD values were used for the calculations utilizing the polyclonal conjugate, and 
1051 for the anti-lgG monoclonal conjugate. As seen in Table 15. when the polyclonal conjugate was used, 
the average was shifted from 0.0493 to 0.0931. and the standard deviation was increased from 0.074 to 
0.0933. [Moreover, the results also show that if the criteria of x +5SD is employed to define the assay 
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cutoff, the polycional-enzyme conjugate configuration in tine ELISA requires a higlier cutoff value. This 
indicates a reduced assay specificity as cornpared to the monocionai system. In addition, as depicted in the 
histogram in Rg. 44, a greater separation of results between negative and positive distributions occurs when 
random blood donors are screened in an EUSA using the anti-IgG monoclonaJ conjugate as compared to 
5 the assay using a conrimercial polyclonal label 

TABLE 15 



COMPARISON OF TWO EUSA CONFIGURATIONS IN 
TESTING SAMPLES FROM RANDOM BLOOD DONORS 


CONJUGATE 


POLYCLONAL 
(Jackson) 


ANTI-IgG 
MONOCLONAL 


Number of samples 
Average (x) 

Standard deviation (SD) 
5SD 

CUT-OFF (5 SD + x) 


1073 
0.0931 
0.0933 
0.4666 
0.5596 


1051 

0,04926 
0.07427 
0.3714 
0.4206 



IV.J. Detection of HCV Seroconversion in NANBH Patients from a Variety of Geographical Locations 

25 Sera from patients who were suspected to have NANBH based upon elevated ALT levels, and who 
were negative in HAV and HBV tests were screened using the RIA essentially as described in Section IV.D.. 
except that the HCV 01 00-3 antigen was used as the screening antigen in the microtiter plates. As seen 
from the results presented in Table 16. the RIA detected positive samples in a high percentage of the 
cases. 

30 

Table 16 



Seroconversion Frequencies for Anti-c100-3 


Among NANBH Patients in Different Countries 


Country 


The 


Italy 


Japan 




Netherlands 






No. Examined 


5 


36 


26 


No. Positive 


3 


29 


19 


% Positive 


60 


80 


73 



45 NX, Detection of HCV Seroconversion in Patients with "Community Acquired" NANBH 

Sera which was obtained from 100 patients witii NANBH, for whom tiiere was no obvious transmission 
route {i.e.. no transfusions, i.v. dmg use, promiscuity, etc. were identified as risk factors), was provided by 
Dr. M. Alter of the Center for Disease Control, and Dr. J. Dienstag of Harvard University. These samples 

so were screened using an RIA essentially as described in Section IV.D.. except tiiat tiie HCV c100-3 antigen 
was used as tiie screening antigen attached to the microtiter plates. The results showed tiiat of tiie 100 
serum samples. 55 contained antibodies tiiat reacted Immunologically with the HCV c100-3 antigen. 

The results described above suggest tiiat "Community Acquired" NANBH is also caused by HCV. 
Moreover, since it has been demonstrated herein tiiat HCV is related to Flaviviruses, most of which are 

55 transmitted by arthropods, it is suggestive that HCV transmission in the "Community Acquired" cases also 
results frorh arthropod transmission. 
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IV.L Comparison of Incidence pf HCV Antibodies and Surrogate Markers iri Donors Implicated \t\ NANBH 
Transmission 

A prospective study was carried out to determine whether recipients of blood from suspected NANBH 

5 positive donors, who developed NANBH, seroconverted to anti-HCV-antibody positive. The blood donors 
were tested for the sun-ogate marker abnormalities which are currently used as markers for NANBH 
infection, i.e., elevated ALT levels, and the presence of anti-core antibody. In addition, the donors were also 
tested for the presence of anti-HCV antibodies. The determination of the presence of anti-HCV antibodies 
was determined using a radioimmunoassay as described in Section IV.K. The results of the study are 

10 presented in Table 17, which shows: the patient number (column 1); the presence of anti-HCV antibodies in 
patient serum (column 2); the number of donations received by the patient, with each donation being from a 
different donor (column 3); tiie presence of anti-HCV antibodies in donor serum (column 4); and the 
surrogate abnonnality of the donor (column 5) (NT or - means not tested) (ALT is elevated transaminase, 
and ANTI-HBc is anti-core antibody). 

15 The results in Table 17 demonstrate that tiie HCV antibody test is more accurate in detecting infected 
blood donors tiian are the sunrogate marker tests. Nine out of ten patients who developed NANBH 
symptoms tested positive for anti-HCV antibody seroconversion. Of the 11 suspected donors, (patient 6 
received donations from two different individuals suspected of being NANBH earners). 9 were positive for 
anti-HCV antibodies, and 1 was borderline positive, and tiierefore equivocal (donor for patient 1). In contrast, 

20 using the elevated ALT test 6 of the ten donors tested negative, and using the anticore-antibody test 5 of 
the ten donors tested negative. Of greater consequence, though, in three cases (donors to patients 8, 9. and 
10) tiie ALT test and tiie ANTI-HBc test yielded inconsistent results. 

Table 17 

25 



DEVELOPMENT OF ANTI-HCV ANTIBODIES IN PATIENTS RECEIVING BLOOD 




FROM DONORS SUSPECTED OF BEING NANBH CARRIERS 




Patient 


Anti-HCV 


No. of 


Anti-HCV - 


Surrogate 




Seroconversion in 


Donations/Donors 


Positive Donor 


Abnormality 




Patient 


















Alt 


Anti-HB 


1 


yes 


18 


equiv 


no 


no 


2 


yes 


18 


yes 


NT 


yes 


3 


yes 


13 


yes 


no 


no 


4 


no 


18 


no 






5 


yes 


16 


yes 


yes 


yes 


6 


yes 


11 


yes(2) 


no 


no 










yes 


yes 


7 


yes 


15 


yes 


NT 


no 


8 


yes 


20 


yes 


no 


yes 


9 


yes 


5 


yes 


yes 


no 


10 


yes 


15 


yes 


no 


yes 



•Same donor as anti-NANBV Positive. 



50 

IV.M. Amplification for Cloning of HCV cDNA Sequences Utilizing the PGR and Primers Derived from 
Conserved Regions o TFIavivtrus Genomic Sequences 

The results presented supra., which suggest tiiat HCV is a fiavivirus or flavi-like virus, allows a strategy 
55 for cloning uncharacterized HCV cDNA sequences utilizing tiie PCR technique, and primers derived from 
the regions encoding conserved amino acid sequences in flaviviruses. Generally, one of the primers is 
derived from a defined HCV genomic sequence, and the otiier primer which flanks a region of unsequenced 
HCV polynucleotide is derived from a conserved region of tine fiavivirus genome. The fiavivirus genomes 
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are known to contain conserved sequences within tiie NS1, and E polypeptides, wiiich are encoded in the 
s'-region of the ilavivirus genome. Corresponding sequences encoding these regions lie upstream of the 
HCV cDNA sequence shown in Rg. 26. Thus, to isolate cDNA sequences derived from this region of the 
HCV genome, upstream primers are designed which are derived from the conserved sequences within 

5 these flavivirus polypeptides. The downstream primers are derived from an upstream end of the known 
portion of the HCV cDNA. - 

Because of the degeneracy of the code, it is probable that there will be mismatches between the 
flavivirus probes and the corresponding HCV genomic sequence. Therefore a strategy which is similar to 
the one described by Lee (1988) is used. The Lee procedure utilizes mixed oligonucleotide primers 

70 complementary to the reverse translation products of an amino acid sequence; the sequences in the mixed 
primers takes Into account every codon degeneracy for the conserved amino acid sequence. 

Three sets of primer mixes are generated^ based on the amino acid homologies found in several 
flaviviruses. including Dengue-2,4 (D-2,4), Japanese Encephalitis Virus (JEV), Yellow Fever (YF). and West 
Nile Virus (WN). The primer mixture derived, from the most upstream conserved sequence (s'-l), is based 

T6 upon the amino acid sequence gly-trp-gly, which is part of the conserved sequence asp-arg-gly-trp-gly- 
aspN found in the E protein of D-2, JEV, YF. and WN. The next primer mixture (5'-2) is based upon a 
downstream conserved sequence in E protein, phe-asp-gly-asp-ser-tyr-ileu-phe-gly-asp-ser-tyr-ileu, and is 
derived from phe-gly-asp; the conserved sequence is present in D-2. JEV, YF, and WN. The third primer 
mixture (5 -3). is based on the amino acid sequence arg-ser-cys, which is part of the conserved sequence 

20 cys-cys-arg-serncys in the NS1 protein of D-2, D-4, JEV. YF, and WN. The individual primers which form 
the mixture in 5'-3 are shown in Rg. 45. In addition to the varied sequences derived from conserved region, 
each primer in each mixture also contains a constant region at the s'-end which contains a sequence 
encoding sites for restriction enzymes. Hindlll. Mbo^ and EcoRI. 

The downstream primer, ssc5h20A, is derived fronrt a nucleotide sequence in clone 5h, which contains 

25 HCV cDNA with sequences with overiap those in clones 14i and 11b. The sequence of ssc5h20A is 

5' GTA ATA TGG TGA CAG AGT CA 3'. 

An alternative primer. ssc5h34A, may also be used. This primer is derived from a sequence in clone 5h. 
30 and in addition contains nucleotides at the s'-end which create a restriction enzyme site, thus facilitating 
cloning. The sequence of ssc5h34A is 

5' GAT CTC TAG AGA AAT CAA TAT GGT GAC AGA GTC A S'. 

35 The PCR reaction, which was initially described by Salki et al. (1986). is canled out essentially as 
described in Lee et al. (1988). except that the template for the cDNA is RNA isolated from HCV infected 
chimpanzee liver, as described in Section IV.C.2.. or from viral particles isolated from HCV infected 
chimpanzee serum, as described in Section IVA.1. In addition, the annealing conditions are less stringent in 
the first round of amplification (0.6M NaCI. and 25* C), since the part of the primer which will anneal to the 

40 HCV sequence is only 9 nucleotides, and there could be mismatches. Moreover, if ssc5h34A is used, the 
additional sequences not derived from the HCV genome tend to destabilize the primer-template hybrid. 
After the^ first round of amplification, the annealing conditions can be more stringent (0.066M NaCI, and 

^ 32 C-37 C), since the amplified sequences now contain regions which are complementary to, or duplicates 
of the primers. In addition, the first 10 cycles of amplification are run with Klenow enzyme I. under 

45 appropriate PCR conditions for that enzyme. After the completion of these cycles, the samples are 
extracted, and run with Tag polymerase, according to kit directions, as furnished by Cetus/Perkin-Elmer. 

After the amplification, the amplified HCV cDNA sequences are detected by hybridization using a probe 
derived from clone 5h. This probe Is derived from sequences upstream of those used to derive the primer, 
and does not overlap the sequences of the ck)ne 5h derived primers. The sequence of the probe Is 

so 

5 CCC AGC GGC GtA CGC GCT GC3A CAC GGA GGT GGC CGC GTC GTG TGG GGG TGT TGT TCT 
CGT CGG GTT GAT GGC GC 3'. 



55 

IV.N.1. Creation of HCV cDNA Library from liver of a Chimpanzee with Infectious NANBH 

An HCV cDNA library was created from liver fix)rn the chimpanzee from which tiie HCV cDNA library in 
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Section IVA.1. was created. The technique for creating the library was similar to that in Section IVA24, 
except for this different source of the RNA, and that a primer based on the sequence of HCV cDNA in clone 
lib was used. The sequence of the primer was 

5 CTG GCT TGA AGA ATC 3. 



iy,N.2. Isolation and nucleotide sequence of overlapping HCV cDNA in clone k9-1 to cDNA in clone lib 

Clone k9-1 was isolated from the HCV cDNA library created from the liver of an NANBH infected 
chimpanzee, as described in Section IV.A.25. The library was screened for clones which overlap the 
sequence in clone 11b, by using a clone which overlaps clone 11b at the s'-tenminus, clone He. The 
sequence of clone 11b is shown in Rg. 23. Positive_dones were isolated with a frequency of 1 in 500,000. 
One isolated clone. k9-1. was subjected to further study. The overlapping nature of the HCV cDNA in clone 
k9-1. to the s'-end of the HCV-cDNA sequence in Rg. 26 was confirmed by probing the clone with clone 
Alex 46: this latter clone contains an HCV cDNA sequence of 30 base pairs which coaesponds to those 
base pairs at the s'temninus of the HCV cDNA in clone 14i. described supra.. 

The nucleotide sequence of the HCV cDNA isolated from clone k9-1 was determined using the 
techniques described supra. The sequence of the HCV cDNA in clone k9-1 , the overlap with the HCV cDNA 
in Rg. 26, and the amino acids encoded therein are shown in Rg. 46. 

The HCV cDNA sequence in clone k9-1 has been aligned with those of the clones described in Section 
IV.A.19. to create a composite HCV cDNA sequence, with the k9-1 sequence being placed upstream of the 
sequence shovyn in Rg. 32. The composite HCV cDNA which includes the k9-1 sequence and the amino 
acids encoded therein Is shown in Rg. 47. 

The sequence of the amino acids encoded in the s'-region of HCV cDNA shown in Rg. 47 has been 
compared with the corresponding region of one of the strains of Dengue virus, described supra., with 
respect to the profile of regions of hydrophobicity and hydrophilicity. This comparison showed that the 
polypeptides from HCV and Dengue encoded in this region, which corresponds to Uie region encoding NS1 
(or a portion thereof), have a similar hydrophobic/hydrophilic profile. 

The information provided infra, allows the identification of HCV strains. The isolation and characteriza- 
tion of other HCV strains may be accomplished by isolating the nucleic acids from body components which 
contain viral particies. creating cDNA libraries using polynucleotide probes based on the HCV cDNA probes 
described infra., screening the libraries for clones containing HCV cDNA sequences described infra., and 
comparing the HCV cDNAs from the new isolates with the cDNAs described infra. The polypeptides 
encoded therein, or In the viral genome, may be monitored for immunological cross-reactivity utilising the 
polypeptides and antibodies described supra. Strains which fit within the parameters of HCV, as described 
in the Definitions section, supra., are readily identifiable. Other methods for identifying HCv strains will be 
obvious to those of skill in the art, based upon the infonnation provided herein. 



Industrial Applicability 



The invention, in the various manifestations disclosed herein, has many industrial uses, some of which 
are the following. The HCV cDNAs may be used for tiie design of probes for the detection of HCV nucleic 
acids in samples. The probes derived from the cDNAs may be used to detect HCV nucleic acids in. for 
example, chemical synthetic reactions. They may also be used in screening programs for anti-viral agents, 
to determine the effect of the agents in Inhibiting viral replication in cell culture systems, and animal model 
systems. The HCV polynucleotide probes are also useful In detecting viral nucleic acids in humans, and 
thus, may serve as a basis for diagnosis of HCV infections in humans. 

In addition to the above, the cDNAs provided herein provide information and a means for syntiiesizing 
polypeptides containing epitopes of HCV. These polypeptides are useful in detecting antibodies to HCV 
antigens. A series of immunoassays for HCV infection, based on recombinant polypeptides containing HCV 
epitopes are described herein, and will find commercial use in diagnosing HCV induced NANBH, in 
screening blood bank donors for HGV-caused infectious hepatitis, and also for detecting contaminated blood 
from infectious blood donors. The viral antigens will also have utility in monitoring the efficacy of anti-viral 
agents in animal model systems. In addition, tiie polypeptides derived from the HCV cDNAs disclosed 
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herein will have utility as vaccines for treatment of HCV infections. 

The polypeptides derived from the HCV cDNAs. besides the above stated uses, are also useful for 
raising anti-HCV antibodies. Thus, they may be used in anti-HCV vaccines. However^ the antibodies 
produced as a result of immunization with the HCV polypeptides are also useful in detecting the presence 

5 of viral antigens in samples. Thus, they may be used to assay the production of HCV polypeptides in 
chemical systems. The anti-HCV antibodies may also be used to monitor the efficacy of anti-viral agents in 
screening programs where these agents are tested in tissue culture systems. They may also be used for 
passive immunotherapy, and to diagnose HCV caused NANBH by allowing the detection of viral antigen(s) 
in both blood donors and recipients. Another important use for anti-HCV antibodies is in affinity chromatog- 

10 raphy for the purification of virus and viral polypeptides. The purified virus and viral polypeptide prepara- 
tions may be . used in vaccines. However, the purified virus may also be useful for the development of cell 
culture systems in which HCV replicates. 

Cell culture systems containing HCV infected cells will have many uses. They can be used for the 
relatively large scale production of HCV, which is normally a low titer virus. These systems will also be 

75 useful for an elucidation of the molecular biology of the virus, and lead to the development of anti-viral 
agents. The cell culture systems will also be useful in screening for the efficacy of antiviral agents. In 
addition, HCV permissive cell culture systems are useful for the production of attenuated strains of HCV. 

For convenience, the anti-HCV antibodies and HCV polypeptides, whether natural or recombinant, may 
be packaged Into kits. 

20 The method used for isolating HCV cDNA, which is comprised of preapring a cDNA library derived from 
Infected tissue of an individual, in an expression vector, and selecting clones which produce the expression 
products which react immunologically with antibodies In antibody-containing body components from other 
infected Individuals and not from non-infected individuals, may also be applicable to the isolation of cDNAs 
derived from other heretofore uncharacterized disease-associated agents which are comprised of a genomic 

25 component. This, in turn, could lead to isolation and characterization of these agents, and to diagnostic 
reagents and vaccines for these other disease-associated agents. 



Claims 

'30 

1 . A purified HCV polynucleotide. 

2. A recombinant HCV polynucleotide. 

3. A recombinant polynucleotide comprising a sequence derived from an HCV genome or from HCV 
cDNA. 

35 4. A recombinant polynucleotide encoding an epitope of HCV. 

5. A recombinant vector containing the polynucleotide of claim 2, or claim 3, or claim 4. 

6. A host cell transformed with the vector of claim 5. 

7. A recombinant expression system comprising an open reading frame (ORF) of DNA derived from an 
HCV genome or from HCV cDNA, wherein the ORF Is operably linked to a control sequence compatible 

40 with a desired host. 

8. A cell transformed with the recombinant expression system of claim 7. 

9. A polypeptide produced by the cell of claim 8. 

10. Purified HCV. 

1 1 . A preparation of polypeptides from tine HCV of claim 10. 
45 12. A purified HCV polypeptide. 

13. A purified polypeptide comprising an epitope which is immunologically identifiable with an epitope 
contained in HCV. 

14. A recombinant HCV polypeptide. 

15. A recombinant polypeptide comprised of a sequence derived from an HCV genome or from HCV 
50 cDNA. 

16. A recombinant polypeptide comprised of an HCV epitope. 

17. A fusion polypeptide comprised of an HCV polypeptide. 
ia A monoclonal antibody directed against an HCV epitope. 

19. A purified preparation of polyclonal antibodies directed against HCV. 
65 20. A particle which is immunogenic against HCV infection comprising a non-HCV polypeptide having 
an amino acid sequence capable of forming a particle when said sequence is produced in a eukaryotic 
host, and an HCV epitope. 

21. A polynucleotide probe for HCV. 
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22. A kit for analyzing samples for the presence of polynucleotides derived from HCV comprising a 
poiynucleotide probe containing a nucleotide sequence from HCV of about 8 or more nucleotideSr in a 
suitable container. 

23. A kit for analyzing samples for the presence of an HCV antigen comprising an antibody directed 
5 against the HCV antigen to be detected, in a suitable container. 

24. A kit for analyzing samples for the presence of an antibodies directed against an HCV antigen 
comprising a polypeptide containing an HCV epitope present in the HCV antigen, in a suitable container. 

25. A polypeptide comprised of an HCV epitope, attached to a solid substrate, 

26. An antibody to an HCV epitope, attached to a solid substrate. 

w 27. A method for producing a polypeptide containing an HCV epitope comprising incubating host cells 
transformed with an expression vector containing a sequence encoding a polypeptide containing an HCV 
epitope under conditions which allow expression of said polypeptide. 

28. A polypeptide containing an HCV epitope produced by the method of claim 27, 

29. A method for detecting HCV nucieic acids in a sample comprising: 

75 (a) reacting nucleic acids of the sample with a probe for an HCV polynucleotide under conditions 

which allow the formation of a polynucleotide duplex between the probe and the HCV nucleic acid from the 
sample; and 

(b) detecting a polynucleotide duplex which contains the probe. 

20 30. An immunoassay for detecting an HCV antigen comprising; 

(a) incubating a sample suspected of containing an HCV antigen with a probe antibody directed 
against the HCV antigen to be detected under conditions which allow tine formation of an antigen-antibody 
complex; and 

(b) detecting an antigen-antibody complex containing the probe antibody. 

25 

31. An immunoassay for detecting antibodies directed against an HCV antigen comprising: 

(a) incubating a sample suspected of containing anti-HCV antibodies with a probe polypeptide which 
contains an epitope of the HCV, under conditions which allow the formation of an antibody-antigen complex; 
and 

30 (b) detecting the antibody-antigen complex containing the probe antigen. 

32. A vaccine for treatment of HCV infection comprising an immunogenic polypeptide containing an 
HCV epitope wherein the immunogenic polypeptide is present in a pharmacologically effective dose in a 
pharmaceutically acceptable excipient 

35 33. A vaccine for treatment of HCV infection comprising inactivated HCV in a pharmacologically 
effective dose in a pharmaceutically acceptable excipient. 

34. A vaccine for treatment of HCV infection comprising attenuated HCV in a pharmacologically 
effective dose in a pharmaceutically acceptable excipient. 

35. A tissue culture grown cell infected with HCV. 

40 36. The HCV infected cell of claim 35, wherein the cell is of a human macrophage cell line, or is of a 
hepatocyte cell line, or is of a mosquito cell line, or is of a tick cell line, or is of a mouse macrophage cell 
line, or is an embryonic cell. 
*^ 37. The HCV infected cell of claim 35, wherein tine cell is of a cell line derived from liver of an HCV 
infected individual. 

45 38. A metiiod for producing antibodies to HCV comprising administering to an individual an isolated 
immunogenic polypeptide containing an HCV epitope in an amount sufficient to produce an immune 
response. 

39. A method for producing antibodies to HCV comprising administering to an individual the polypeptide 
preparation of claim 11. wherein the preparation contains at least 1 immunogenic polypeptide, and the 

50 administering is of an amount sufficient to produce an immune response. 

40. A method for isolating cDNA derived from the genome of an unidentified infectious agent 
comprising: 

(a) providing host cells transformed with expression vectors containing a cDNA library prepared from 
nucieic acids isolated from tissue infected with the agent and growing said host cells under conditions 

55 which allow expression of polypeptide(s) encoded in the cDNA; 

(b) interacting tiie expression products of the cDNA with an antibody containing body component of 
an individual infected with said infectious agent under conditions which allow an immunoreaction. and 
detecting antibody-antigen complexes fonmed as a result of the interacting; 
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(c) growing host cells which express polypeptides that form antibody-antigen complexes in step (b) 
under conditions which allow their growth as individual clones and isolating said clones;- 

(d) growing cells from the clones of (c) under conditions which allow expression of polypeptide(s) 
encoded within the cDNA, and interacting the expression products with antibody containing body compo- 

5 nents of individuals other than the individual in step (a) who are infected with the infectious agent and with 
control individuals uninfected with the agent, and detecting antibody-antigen complexes formed as a result 
of the interacting; 

(e) growing host cells which express polypeptides that form antibody-antigen complexes with 
antibody containing body components of infected individuals and individuals suspected of being infected, 

10 and not with said components of control Individuals, under conditions which allow their growth as individual 
cloneis and isolating said clones; and 

(f) isolating the cDNA from the host cell clones of (e). 
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FIG. I Translation of DNA 5-1-1 

AlaSerCysLeuAsnCysSerAlaSerllelleProAspArgGluValLeuTyrArgGlu 
1 GGCCTCCTGCTTCAACTGCTCGGCGAGCATCATACCTGAG2W5GGAAGTCCTCaA 

CCGGAGGACGAACTTGACGAGCCGCTCGTAGIIATGGACTGTCCCTTCAGGAGATGGCTCT 

PheAspGluMetGluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMet^ 
61 GTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCT 
CAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTMCTCGTTCCCTA^^ 

AlaGluGlnPheLysGlnLysAlaLeuGlyLeu 
121 CGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCC 
GCGGCTCGTCAAGTTCGTCTTCCGG6AGCCGGAGG 



FIG- 3 Translation of DNA 5-1-1, 81, 91&1-2 

GlyCysValVallleValGlyArgValValLeuSerGlyLysProAlallelleProAsp 
1 CTGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTG 
GACCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGAC 

T 

ArgGluValLeuTjprArgGluPheAspGlxiMetGluGluCysSerGlnHisLeuProTyr 
61 ACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGT 
TGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCA 
A 

IleGluGlnGlyHetMetLeuALaGluGlnPheLysGlnLysAlaLeuGlyLeiiLeuGln 
121 ACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGC 
TGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACG 

ThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLysLeu 
181 AGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAAC 
TCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTG 

GluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeiiAlaGly 
241 TCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGG 
AGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCC 

* 

LeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAlaVal 
301 GCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTG 
CGAACAGTTGCX3ACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGACGAC 

ThrSerProLeuThrThrSerGln 
361 TCACCAGCCCACTAACCACTAGCCAAA 
AGTGGTCGGGTGATTGGTGATCGGTTT 



EP 0 318 216 A1 





EP 0 318 216 A1 



FIG. 4 Translation of DNA 81 

SerGlyLysProAlallelleProAspArgGluValLeuTyTArgGluPheAspGliMet 
1 GTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGAT 
CAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGGTCTCAAGCTACTCT 

GluGluCysSerGlxiHisLeuProTjrrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
6 1 GGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAA6GGATGATGCTCGCCG AGCAGTT 
CCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAA 

LysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGliiAlaGlu^ 
121 CAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCC 
GTTC6TCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGG 

AlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhe 
181 TGCT6TCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATC 

ACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAA 

IleSerGlylleGlnTyrLeioAlaGlyLeuSerThrLeuProGlyAsnProAlalleAla 
241 CATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGC 
GTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACG 

SerLeuMetAlaPheThrAlaAlaVeaThrSerProLeuThrThrSerGln 
3 01 TTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTT 



FIG. 5 Translation of DNA 36 

AspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrlieuValAla 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGA2VAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrp 
6 1 TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAG ATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACC 

LysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArglieu 
121 AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTC^ 

GlyAlaValGlnAsnGluIleThrLeuThr His ProVa IThrLy sTy r IleMetThrCy s 
181 GGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGC 
CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

MetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAla 
24 1 ATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGGTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 



Alal/euAlaAlaTyrCysLeixSerThrGlyCysValVallleValGlyArgValVallieu 
301 GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

Overlap with 81 

SearGlyliysProAlallelleProAspArgGluValLeuTyrArg 
361 TCCGGGAAGCCGGCAATCATACCTG ACAGGGAAGTCCTCTACCGAG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTC 
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FIG. 6 Combined ORF of DNAs 36 & 81 



AspAlaHisPheLeuSerGlnThrLysGlnSerGLyGluAsnLeuProTyrLeuValAia 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAAT^ 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrp 
6 1 TACC?UVGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAG A 

ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTAC^ 

IiysCysLeuIleArgljeuLysProThrLeuHisGLyProThrProLeuLeuTyrArgLeu 
121 AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTG 
TTCACAAAGTAAGCGGAGTTCGGGOXSGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCys 
181 GGCGCTGOTCAGAATG2>JiATCACCCTGACGCACCCAGTCACCAAATAC^ 

CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

MetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAla 
241 ATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCT 
TACAGCaSGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCMCCGCCGCAGGACC^ 

AlaLeiiAlaMaTyrCysI^uSerThrGlyCysValVallleValGlyArgValValLeu 
301 GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTC 
CGAAACOSGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMet 
361 TCCGGG AAGCCGGCAATCATACCTG ACAGGGAAGTCCTCTACCG AG AGTTCG ATGAGATG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTAC 

GluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
421 GAAGAGTGCTCTCAGCACOTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGC^ 

CTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTC^ 

LysGlnLysAlal^uGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaPro 
481 AAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCT 
TTCGTCTTCCm;AGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATA^ 

AlaValGlnThrAsaTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsriPhe 
541 GCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATCTC 
CGACAGGTCTGGTTGACCGTTTTTGAGCTC^ 

IleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAla IleAla 
601 ATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCC^ 

TAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATO^^ 

SerLeuMetJaaPheThrAlaAlaValThrSe^ProLeuThrThrSerGln 
661 TCATTGATGGCTTTTACAGCTGCTGTOVCC^^ 

AGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGT^ 




FIG. 7 Translation of DNA 32 



Overlap with 81 

PheThrftlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeu 
1 CTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATAT 
GAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATA 

GlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAla 
61 TGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCG 
ACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGC 

GlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeu 
121 CTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCC 
GACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGG 

AlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGlu 
181 TTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTG 
AACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCAC 

ValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeu 
241 AGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCC 
TCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGG 

ValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAla 
301 TCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGG 
AGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCC 

ValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
361 CAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
GTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 
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FIG. 8 Translation of DNA 35 

SerlleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGG 
AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
61 ACTGGCAGGGGGAAGCGAGGCATCTACAGATTOXSTCGCACCGGGGGAGCGCCCCTCCGGC 
TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCG 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeu 
121 ATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTCCTTGGTATGAGCTC 
TACAAGCTGAGCAGGCAGGAGACACTCACGAITACTGCGTCCGACACGAACCATACTC^ 

ThrProAlaGluThrOlnrValArglieiiArgAlaTyrMetAsnThrProGlyl^ 
181 ACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTG 
TGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGyGGGGCCCCGAAGGGCAC 



CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
24 1 TGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCT^ 

ACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 



HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGln 
301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAA 
GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATC^ 

Overlap with 36 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTG^ 

CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 



LeuIleArgLetLLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
421 TTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGC^^ 
AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 
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FIG. 9"" I Combined ORF of DNAs 35,36,81 & 32 

SerlleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGG 
AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
61 ACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGC 
TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCG 

MetPheAspSerSerValLeuCysGluCysTyJ^AspAlaGlyCysAlaTrpTyrGluLeu 
121 ATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTC 
TACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
181 ACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTG 
TGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCAC 

CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
241 TGCCAGGACCATCTTGAATTttGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCC 
ACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeiiProTyrLeuValAlaTyrGln 
301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAA 
GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGT^ 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGT 
CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
421 TTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCT 
AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 

ValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleKetThrCysMetSer 
481 GTTCAG AATG AAATCACCCTG ACGCACCCAGTCACCAAATACATCATGACAT^^ 
CAAGTCTTACTTTAGTGGGACDSCGTGGGTCAGTGGTTTATGTAGTACTGT^^ 

AlaAspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeu 
541 GCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTCCTT 

CGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAAC 

AlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeuSerGly 
601 GCCGCGTATTGCCTGTCAACAGGCTGCGTGGTOVTAGTGGGCAGGGTCGTCTTC^ 
CGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCa^GCAGAA^ 

LysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGlu 
661 AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAG 
TTCGGCCGTTAGTATGGACTGTCCCTtCAGGAGATGGCTCTOCM 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGln 
7 21 TGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTC^ 
ACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTC^ 

LysAlaLeuGlyLeuLeuGlrxThrAlaSerArgGlnAlaGluVallleAlaProAlaVal 
7 81 AAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGT^ 
TTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGTO 



EP 0 318 216 A1 ' :* • ' 



1 * 



1081 



-1 ^i-JJ?!}? *^"^5P^l"^ys^'^GluThrPheTrpAlaLysHisMetTrpAsnPherieSer 
841 CAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGT 
GTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCA 

S^y^^®^^°^y^^®"AlaGlyLeuSerThrLeuPrbGlyAsnProAlaIleAlaSerLeu 
901 GGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTG 
CCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTCGGGCGGTAACGAAGTAAC 

MetAlaPheThrAlaAlaVaiThrSerProLeuThrThrSerGlnThrLeuLeuPheAsn 
961 ATGGCTTTTACAGCTGCTGTCACCAGCCd^CTAACCACTAGCCAAACCCTCCTCTT^ 
TACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTG 

lleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
1021 ATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTG 
TATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACAC 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsD 
GGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGAC 
CCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTCACCCCTTCCAGGAGTATCTG 

n >, , f i^^AlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLy s IleMetSer 
114 1 ATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGC 
TAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCG 

v^«, ^ly^^^ValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlv 
1201 GGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGA 
CCACTCCAGGGGAGGTGCCTCCiX^GACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCT 

T "V^l"^*lGlyValValCysAlaAlaIleLeuArgArgHisValGlyProGlyGlu 
^2^^ GCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTC^^ 

CGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTC 

, SlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
^^^^ ^GGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGG^^ 

CCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGCTACAA^ 

FIG. 9-2 



EP 0 318 216 A1 




FIG. IQ Translation of DNA 37b 

GAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTG 

X^Si2S^I^-'--^®^^°^^^^ly^PValValValValAlaThrAspAlaLeuMetThr 
^^^^^^S^^^^^^^^^^^GCSATGTTGTCGTCGTGGCAACCGATGCCSS^^ 
CACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGG 

GlyTyrThrGlyAspPheAspSerVallleAspTyrAsnThrCysValThrGlnThrVal 

S^ST^^^^^^^^^'^^actcggtgatagactacaatacgtgtgtcacccagaS^ 

CCGATATGGCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAG 



AspPheSerLeuAspProThrPheThrlleGluThrlleThrl^uProSSAsDAlaVal 
CTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAG 



clone 35 

SerArgThrGlnArgArgGlyArgThr 

TCCCGCACTCAACGTCGGGGCAGGACTG 

AGGGCGTGAGTTGCAGCCCCGTCCTGAC 



FIG. II Translation of DNA 33b 
-Overlap with 32- 



CTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTAC^ 

CG(KCTCTCCCT*CGTCGAOGG(KSC&SIGaCGGTATGMTCGICGGA^^ 

CGflGGACICCGCIGACGI(5GTCSCCMTCGAGCCT(aCATCCTGAGCTAra 
GACa»TTCCCTGTMACCCKaiCCTATACGCTCCAC«lCTCS?^ 

TTITCMTTCGKTAC(KIGTCMCGGACCCTAG<S3GAAACi5^ 

LysGlyValTrpArgVal 
TAAGGGGGTCTGGCGAGTG 
ATTCCCCCAGACCGCTCAC 
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FIG. I 2 Translation of DNA 46b 

AlaT^rMetSerLysMaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1 GGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAAT 
CCGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAl^ 
61 TACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC 

SerGlyGiyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaThrAlaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
301 TGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA. 



LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaiaa 
361 CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAG AAGAAGTGCG ACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

: : ' Overlap with 37b 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
421 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 



IleProThr 
481 CATCCCGACCAG 
GTAGGGCTGGTC 
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FIG. 13 Translation of DNA 25c 



CysSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrplleSerSerGluCvs 
1 ACTGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGT 
TGACGTOSGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCA 



ThrThrProCysSerGlySerTrpLeiiArgAspIleTrpAspTrpIleCysGluValLeu 
61 GTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGT 
CATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACA 



-Overlap with 33b- 



SerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGlylleProPhe 
121 TGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCT 
ACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGA 



ValSerCysGlnArgGlyTyrLysGlyValTrpArgGlyAspGlylleMetHisThrArq 
181 TTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGGGGACGGCATCATGCACACTC 
AACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCCCCTGCCGTAGTACGTGTGAG 

CysHisCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArglleValGly 
241 GCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCG 
CGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGC 

ProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGly 
301 GTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGG 
CAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCC 

« Pi^oCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGlu 
361 GCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAG 
CGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTC 

^^S^i^^^X^lGl^IleArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAsp 
421 AGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTG 
TCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGAC 

AsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
481 ACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
TGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 
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FIG. I 4- 1 Combined ORF of DNAs 40b/37b/35/36/81/32/33b/25c 

AlaTyrHetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1 TGGTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAAC^ 
ACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProILeThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
61 TACCACTGGCAGCCCCATCACGTACTCCACCXACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCC^^ 

SerGlyGlyAlaXyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGOXSTAG 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaXhrAlaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLy sAla IleProLeuGluVal I le 
301 TCCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 

LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
361 CAAGGGGGGG AGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCG ACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
421 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGXAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 

IleProThrSerGlyAspValValValValAlaThrAspAlaLeuMetThrGlyTyrThr 
481 CATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATAC 
GTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATG 

GlyAspPheAspSerVallleAspTyrAsnThrCysValThrGlnThrValAspPheSer 
54 1 CGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTCGATTTCAG 
GCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAGCTAAAGTC 

LeiiAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
601 CCTaX3ACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCAC 
GGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTG 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGly 
661 TCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGG 
AGTTGCAGeCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCC 

GluArgProSerGlyMetPheAspSerSerVedLeuCysGluCysTyrAspAlaGlyCys 
721 GGAGCGCCCCTCCGGCATGTOXZGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTG 
CCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGAC 

AlaTrpTyrGluLeuThrProAlaGluThrThrValArgLexiArgAlaTyrMet^ 
781 TGCTTGGTATGAGCTCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACAC 
ACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTC 

ProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeu 
841 CCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCT 
GGGCCCCGAAGGGCACACGGTCCTGGTAGAACXTAAAACCCTCCCGCAGAAATGTCCGGA 

ThrHislleAspAlaHisPheLeiiSerGlnThrLysGlnSerGlyGluAsnLeuProTyr 
901 CACTCATATAGATGCCC^CTTTCrATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCT 

GTGAGTATATCTACGGGTGAAAGATAGGGTCTGTl?TCGTCTCACCCCTCTTGGAAGGAAt 
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1861 

FIG. 14 



LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 
961 CCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGA 
GGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGIAGC^^ 

GlnMetTrpLysCysLeuIleArgLeiiLysProThrLeimisGlyProThrProLeuIieu 
1021 CCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACC 
GGTCTACACCTTCACAAACTAAGCGGAGTTCGGGaX3GGAGGTACCCGGTTGTGGG 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
1081 ATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACAT 
TATGTCTGACCCGCGACAAGTCTTACTTTAGOXSGGACTGCGTGGGTCAGTGGTTTATGTA 

MetThrCysMetSerAlaAspLeuGluValValThrSerThrTrpVallieuValGlyGly 
1141 CATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGG 
GTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCC 

ValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGl5rCysValValIleValGlyArg 
1201 CGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAG 
GCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTC 

ValValLeuSerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPhe 
1261 GGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTT 
CCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAA 

AspGluMetGluGluCysSerGlnHisLeuProTyrlleGIuGlnGlyMetMetLeuAla 
1321 CGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGC 
GCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCG 

GluGlnPheLysGlnLysAlalieuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal 
1381 CGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGT 
GCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCC^ 

IleAlaProAlaValGlnThrAsnTrpGlnliysLeuGluThrPheTrpAlaLysHisMet 
1441 TATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATAT 
ATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATA 

TrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
1501 GTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCC 
CACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCA^ 

AlalleAlaSerrieuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
1561 CGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCA 
GCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGT 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
1621 AACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGC 
TTGGGAGGAGAAGTTGTATAACCCCGCCACCCACCGACGGGTCGAGCGGCGGGGGCCACG 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 
1681 CGCTACTGCCTTTCTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACI^^ 
GCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCC 

LysValLeuIleAspIleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAla 
1741 GAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGC 
CTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGA^ 

PheLysIleMetSerGlyGluValProSerThrGluAspLeuValAsnl^uL^ 
1801 ATTCAAGATCATCAGCGGTCAGGTCCCCTCCACGGAGGACCTGGTCAATCT^^ 

TAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGAC^ 

IleLeuSerProGlyAlaIieuValValGlyValValCysAlaAlaIler,e\iArgArgHis 
CATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTCCAGCAATACTGCGCC^ 
GTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGT 



-2 



ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArg 
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1921 CGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCG 
GCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGC 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 
1981 GGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCAC 
CCCCTTGGTACAAAGGGGGTGCGTCATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTG^ 

AlalleLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrp IleSer 
2041 TGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAG 
ACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTCACGTGGTCACCTATTC 

SerGluCysThrThrProCysSerGlySerTrpLeiiArgAsplleTrpAspTrpIleCys 
2101 CTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATG 
GAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATAC 

GluValLeuSerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGly 
2161 CGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGG 
GCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACC 

IleProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
2221 GATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCAT 
CTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTA 

HisThrArgCysHisCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArg 
2281 GCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAG 
CGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTAGTC 

IleValGlyProArgThrCysArgAsnMetTrpSerGlyXhrPheProIleAsnAlaTyr 
2341 GATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTA 
CTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTAGGGAT 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
2401 CACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGT 
GTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCA 

SerAlaGluGluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMet 
2461 GTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTAT 
CAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATA 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
2521 GACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
CTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 

FIG. 14-3 
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FIG* 15 XranslatioDt of DNA 33c 

AlaValAspPhelleProValGlioAsiiLeuGluThrThrMetArgSerProValPheThr 
1 GGCGGTGGACirrrATCCCTGTGGAGAACCTAGAGACAACC^ 
CCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACT^ 

AspAsnSerSerProProValValProGlnSerPheGlnValAlaHlsLeuHisAlaPro 
61 GGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTC 

CCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTAC^^ 

ThrGlySerGlyLysSerOtoLysValProAlaAlaa^AlaAlaGlnGlyTyrLysVsUL 
121 CACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGC^ 
GTGTCCGTCGCCGTTTTCGTGGTTCCJ^GGGCCGACG 



LeuVall^iiAsnProSerVaaAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLysAla 
181 GCTAGTACTCAACCCCTCTGTTGCTGCAACAC!!^ 

CGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAA!!^ 

O verlap with 40 b 

HisGlylleAspProAsnlleArgThrGlyValArgThrlleThrTh^ 
241 TCATGGGATCGATCCa^CATCAGGACCGGGGTGAGAACAATO^ 
AGTACCCTAGCXftfiGAITGTAGTCCTGGCCCCACTCTTGTT^ 



ThrfTyrSerThrTyxGlyLysPheLetiAlaAspGlyGlyCysSerGlyGlyAlaTy^ 
301 CACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTC 

GTGCATGAGGTGGATGCeGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGA^ 



IlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThr 
361 CATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTC 

GTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGT^ 



Vall^iiAspGlnAlaGluThrAlaGlyAlaArgLeuValVcilI^uAlaThri^ 
4 21 TGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTC^ 

ACAGGAACTGGTTCGTCTCTGACGCCCCOSCXCTGACCAACACGA^ 



ProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThr^ 
481 TCCGGGCTCCGXCACTGTGCCCCAOXZCCAACATCGAGGAGGXTGCTCTGTCCACCA^^ 
AGGCCCGAGGCAGTCACaCGGGGTAGGGTTGTAGCTCCTCCAACGAGA^ 



GluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeu 
541 AGAGATCCCTTraXACGGCAAGGC^ 

TCTCOSIGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGT^ 



IlePheCysHlsSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGly 
601 CATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCT 
GTAGAAGACAGTaAGTTTCTTCTTCACGCTGCTTGAGC^ 



lleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAsp 
661 CAXCAATGCCGXGGCCTACTACCGCGGTCTTGACGTCTCCGTCATCCCGACCAGCG^^ 

GTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCT 

ValValValValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSerVal 
721 TGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGT 
ACAACAGCAGCACCGrrcGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGA^ 

IleAspCysAsnXhrCys 
781 G ATAGACTGCAATACGTGTG 
CTATCTGACGTTATGCACAC 
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FIG. 16 TranslatioQ of DNA 8h 

ProCysThrCysGlySerSeri^pLeuTTrLeuValThrArgHisAlaAspVal 
1 CTCCCTGCACTT6CGGCTCCnx:GGACCTTTACCTGGT^ 

GAGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCASTGC^ 

ValJ^ArgArgGlyAspSerArgGlySerl^uLeuSerProi^gProIleSer^^ 
61 CCGTGCGCCGGCGGGGTGATAGCAGG6GCAGCCTGCTGTCGCCCC 
GGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGG^ 

LysGlySerSerGlTGlyProLeuLeuCysProAlaGlyHislllaValGlyllePheArg 
121 TGAAMGCTCCTCGGGGGGTCCGCXXSTTGTGCCCCGCGGTC 
ACTTTCCGAGGAGCCCCCCaGGCGACAACAOSGGG^ 

Overlap with 

MaAlaValCysThrArgGlyValAlaLysAlaValAspPhelleProValGluAsnLeu 
181 GGGCCGCGGTGTGCACCCGTGGAGTGGCraAGGCGGTGGACTT^ 
CCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTG^ 

33c 

GluThrThrMetArgSerProVeilPheThrAspAsnSer 
241 TAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTC 
AtCTCTGTTGGTACTCCAGGGGCaVCAAGTGOT 



FIG. IT Translation of DMA 7e 

GlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeiil^u^^ 
1 GGGGTGGAGGTTGCTGGCGCCCaTCaiaSSCGlA^ 

CCCCACCTCC»ACGACCGGGGGTAGTGCCGCATGCGGGTCGTC 

CysIlelleThrSerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGlnlle 
61 GTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGG^ 
CACGOiATTAGTGGTCGGATTGACCGGCCCrcTTTTTG^ 

ValSerThrAlaAlaGlnllirPheLeuAlaThrCysIleAsnGlyValCysT^ 
121 TGTGTO^CTCCTGCCCAAACCTTCCTGGCAAC^ 

ACACAGTTGACGACGGGTTOXSGAAGGACCGTTGCAC^ 

TyrHisGlyAlaGlyThrArgThrlleAlaSerPrdLysGlyProVallleGlnMetS^ 
181 CTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCaAGGGTCX^ 
6ATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCC 



ThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeuTto 
241 laCCAATGTAGACCAAGACCTTGTGGGCTGGCCCMCTCC^^ 

ATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGACKCGl^^ 

Overlap with 8h . 

ProCysTlirCysGlySerSerAspLeuTyrLeuValThrArgHia 
301 ACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
TGGGACGTCAACGCCGAGGWKCTGGAAATGGACCAGTGCTCCGTGC 
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FIG. 18 Translation of DNA 14c 



AsnMetTrpSerGlTThrPheProIleAsnMaTyxThriairGlyProC^ 
1 ACATGTGGAGTGGGACCTTCCCCA!^^ 
CTTGTACACCTCACCCTGGAAGGGGTAAT^ 

Overlap with 25c — 

ProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyrValGluIle 
61 TCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAG^^ 
J«^CGCGGCTTGATGTGCAAGCGCGATACCTGCCA^ 



ArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeiiLysCysPro 
121 W^GGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACT^ 

TTCCGTCCACCCCCTCAAGGTGATGCACTGCCCATACTGATGACTCXl^^ 



C^-sGlnValProSerProGluPhePheThrGluLeuAspGlyValArgLeuHisArgPhe 
181 GTGCCAGGTCCCaTCGCCCGAATTTTTCACAGAATTGGACGGGGTC 

CACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAAGCTGCCCCa^GCGGA^ 

AlaProProCysLysProLeuLexiArgGluGluValSerPheArgValGlyLeiiHtsGlu 
241 TGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAG6AC 

ACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCAlAGTAAGTCTCaTCCTGA^ 

TyrProValGlySerGlnLeuProCysGluProGluProAspValAlaVailLeuThrSer 
301 ATACCCGGTAGGGTCGCAATTACCTOXSCGAGCCCGAACCGGACGTGGCCGIGTTGACGTC 
TAOXSGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTCCACCGG^ 

MetLeuThr AspProSerHis IleThrAlaGlxiAlaAlaGlyArgArgLeiiAlaArgGly 
361 CATGCTOUITGATCCCTCCCAT ATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGG^ 
GTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCC 

SerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAla 
421 ATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGC 
TAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGl^ 

ThrCysThrAlaAsnHisAspSerProAsp 
481 AACTTGCACCGCTAACCATGACTCCCCTGAT 
TOXSAACGTGGCGATTGGTACTGAGGGGACTA 
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FIG* 19 Translation of JDHA 8f 

■ Overlap with i4c 

SerSerSerAlaSerGlnLeuSerAlaProSerLeiiLysAlaThrt^ 
1 AGCTCCTCGGCTRGCCWKriaT 

TCGAGGAGCCGATCGGTCGAOaGGCGaGGTftGAGAGTTCCGTTGAA 



AspSerProAspAlaGluI^iiIleGliiAlaAsiaLeiiLeuTrpArgGlnGluM^ 
61 GACrcca3GATGCTGAGCTCAT2^(^^ 

CTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATAC^ 

• AsnlleThrArgValGliiSerGliiAsnLysValVallleLeiiAspSerPheA^Prc^ 
121 MCATCACCaGGGITGAGTCAGAAAAO^!^^ 
TTGTiAGTGGTCCCaACrCMTC3^^ 

ValJVlaGluGluAspGluArgGluIleSerValProzaaGlxiIlelieuAM 
181 GTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAAI^^ 
CACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTC^^ 

ArgPheAlaGlnMaLeuProValTrpAlaArgProAspTyrAsnProPrbLeuValGlu 
241 AGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGAC^^ 

TCTAAGQ3GGTCCGGGACGGGCAAACCCGCGCCGGCCTGAIATTG© 

ThrTrpLysLysProAspTyiGluProProValValHisGlyCysProLeiiProProPro 
301 ACGTGGAAAAAGCCCGACiaCGAACCACCTGTGGTCCATGG^ 
TGCACCTTTTTCGGGCTGAOXKITaXMTG^ 

LysSerProProValPro 
361 AAGTCCCCTCCTGTGCCG 
TTCAGGGGAGGACACGGC 



FIG. 20 Translation of JDNA 33f 



ValTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysPrcAspTyr 
1 CGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGlXa; 
GCAAACCCGCGCCGGCCTGAIATTGGGGGGCGATCACCTCTXSC^^ 

Ov^erlap with 8f — 

GluProProValValHisGlyCysPrOLeuProProProLysSerProProValProPro 
61 CGAACCACCTGTGGTCCATGGCTGCCCGCTTCCaCCTCCaAAGTCC 
GCTTGGTGGACACCACGTACCGACGGGCGAAGGTGGAGGTTTCAGG® 

ProArgLysLysArgOftrValVall^uThrGlxxSerThrrieiiSerThr^ 
121 GCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCaACCCTATCTACl^ 
CGGAGCCTTCTTCGCCTGCCACCAGGAGTGACT^ 

LeuAlaThrArgSerPheGlySerSerSerThrSerGlylleThrG 
181 GCTCGCCACCMAAGCTTTGGCAGCTCCTCAACTTCCGGCAl^ 

CGA6C6GTGGTCTTCGAAACCGTCGAG6AGTTGAAGGCCGXAAIGCCCGCTGTTATG 

ThrSerSerGliiProAlaProSerGlyCysProProAspSerAspAlaGl^ 
24 1 AA(a.TCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACrc 

TTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCACGA^ 
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FIG. 22 Translation of dna 7f 
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FIG. 23 Translation of DNA lib 

GlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLysAr 
1 GGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTAT 
CCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATA 

IleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHis 
61 ATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAAC^ 
TAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTC^ 

ValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeuMeiiCys 
121 GTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGT 
CACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACA 

AlaValHisProThrLeuValPheAspIleThrLysLeiiLeuLeuAlaValPheGlyPro 
181 GCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCC 
CGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGG 

LeuTrpIleLeuGlnAlaSerLeuLeuLysValProTyrPheValArgValGlnGlyLeu * 
241 CTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCC 

GAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAA 

LetiArgPheCysAlaLeuAlaArgLysMetlleGlyGlyHisTyrValGlnMetVallle 
301 CTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATC 
GAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAG 



IleLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeuArgAsp 
361 ATTAAGTTAGGGGCGCTTACT6GCACCTATGTTTATAACCATCTCACTCCTCTTCGGGAC 
TAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCOX; 

Overlap with 7f 

TrpAlaHisAsnGlyLeiiArgAspLeuAiaValAlaValGluProValValPheSerGln 
421 TGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCC 

ACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTT 



MetGluThrLysLeuIleThrTrpGly 
481 ATGGAGACCAAGCTCATCACGTGGGGGGC 
TACCTCTGGTTCGAGTAGTGCACCCCCCG 
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FIG. 24 Translation of DNA 14i 

GluTyrValVall^uLeuPheLeuLeuLeiiAlaAspAlaArgValCysSerCysLeuTrp 
1 GGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsrO^uVallleLeuAsnAla 
6 1 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGqCGAAACCTCTTGGAGCATTATCAATTA^ 

AlaSerlieuAlaGlyThrHisGlyLeuValSerPheLeuValPhePheCysPheAlaTrp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGITVGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGA^ 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProI^ 
18 1 GGOlATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCaiACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCT^ 

LexiLetilieiiLexiLetiAlaLetiProGlnArgAlaTyrAlal^uAspT^^ 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 



-Overlap with lib 



SerCysGlyGlyValVallieuValGlylieuMetAlaLeuThrLeuSerProTyrTyrLys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTCATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTAl^ 



ArgTyrlleSerTrpCysLeuTrpTrpIieuGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGAA 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCTT 

FIG. 25 Translation of DNA 39c 



ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
1 CCAGCCCCTTCaK5GCTGCCCCCCCGACTCCGACGCTGAGTCCTAa?TCCTCCATGCCCCCC 
GGTCGGGGAAGACCGACGGGGGGGCTGAGGCTCCGACTCAGGATAAGGAGGTACGGGGGG 

LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSer 
6 1 CTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACAGTCAGTAGT 
GACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGTCAGTCATCA 



-Overlap with 33g 



GluAlaAsnAlaGliiAspVcilValCysCysSerMetSerTyrSerTrpThrGlyAlalieu 
121 GAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCCTACTCTTGGACAGGCGCAC^ 
CTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGGATCAGAACCTGTCCGCGT^ 

ValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlcLLeuSerAsnSerLeu 
181 GTCACCCCGTGCGCCGCGGAAGAACAGAAACTCCCCATCAATGCACOXSAGCAACTCGTTC 
aVGTGGGGCACGCGGCGCCTTCTOXSTCTOTGACGGGTAGTTACGTGACTC^ 



LeiiArgHisHisAsiiLeuVcLlTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
241 CTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGC^^ 

GATGCAGTGGXGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTC 

LysValThrPheAspArgLeuGlnValLeiiAspSerHisTyrGlnAspValLeuLysGlu 
301 AAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAG 
TTTCAGTGTAAACTGTCOX^ACGTTCAAGAGCTGTCGGTAATGGTCCTC 

ValLysAlaAlaAlaSerLysValLysAlaAsnPhe 
361 GTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTC 
CAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAAG 
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FIG. 26-1 COMBINED ORP OF DNAs 
14iAlb/7f/7e/8h/33c/40b/37b/35/36/81/32/33b/25c/14c/8f/33f/33g/39c 

GluTyrValValLeuLeuPheLeuLeiiLeiiAlaAspAlaArgValCysSerCysLeuT^ 
1 GGGAGTACGTCGTTCTCCOXSTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACG^^ 

MetMetJjeiil^eulleSerGliiAlaGltiAlaAlaLeuGluAsnLeuVallle^ 
61 GGATGAIXXITACTCATATCCGAAGCGGAGGCGGCTTTGG 

CCTACTACGAOXSAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

AlaSerLeiiAlaGlyThrHisGlyLeuValSerPheLeuValPhePheCysPh 
121 CAGCMCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGC^ 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCAC?^GAAGACGAAACGT^ 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCaSGAGCGGTCGEACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGi^^ 

LeuLeuLeuLeuLexiAlaLeuProGlnArgAlaTyrAlaLeuAsp^ 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTG 

AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCAT6CGCGACCTGTGCCTCCACCGGC 

SerCysGlyGlyValVall^uValGlyLeiiMetAlal^uThrLeuSerProTyr^ 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTCACTCTGTCACCATA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGG 

ArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGliiAlaG^ 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCOXSACCA^ 

TCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTC^ 

LeuHisValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVaJ.IleLeuLeu 
421 AACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTT^ 
TTGAOSTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATC 

MetCysAlaValHisProThrLeuVaiPheAspIleThrLysLeuLeuLeuAlaValPhe 
481 TCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGC^ 
AGOACACACGACATGTGGGCTGAGACCATAAACTCTAGTGGTTTAA^ 

GlyProLeuTrpIleLeuGlnAlaSerLeuLetiLysValProTyrPheVeJJV^^ 
541 TCGGACCCCTTTCGATTCTTCAAGCCAGOT 

AGCCOXSGGGAAACCTA^A^GTTCGGTCAAACGAATTTCATCGGATGAAAC^^ 

Glyl^iiI^uArgPheCysAlaLeiiAlaArgLysMetlleGlyGIyHisTyrValGlc^ 
601 AAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT^^ 
^ TTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTC 

ValllelleLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeu 
661 TCGTCATCATTAAGTOaGGGGCGCTTACTGGCACCTATGTTTATAACC^ 

ACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAG 

ArgAspTrpAlaHisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPhe 
721 TTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTCGCTGTAGAGCCM 
AAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATC^ 

SerGlnMetGluThrLysLeuIleThrTrpGlyAlaAspThrAlaAlaCysGlyAspIle 
781 TCTCCCAAATGGAGACCAAGCTCAax:ACGTGGGGGGCAGATACCGCCGCGTGCGGTGA<^ 
AGAGGGTOrilACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATOT 

IleAsnGlyLeuProValSerAlaArgArgGlyArgGluIleLeiilieuGlyProAlaAsp 
841 TCATCAACGGCTTCCCTCTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCCAGCra 
AGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGC 

Glj^etValSerLysGlyTrpArgLeuLeuAlaProIleThrAlaTyrAIaGlnGlnThr 
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901 ATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGA 
TACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCT 

ArgGlyl^iiLeuGlyCysIlelleThrSerLeuThrGlyArgAspLysAsnGlnValGlu 
961 CAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGOX^ 
GTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACC 

GlyGluValGlnlleValSerThrAlaAlaGlnThrPheLeiiAlaThrCysIleAsnGly 
1021 AGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCA 

TCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTCGAAGGACCGTTGCACGTAG 

ValCysTrpThrValTyrHisGlyAlaGlyThrArgThrlleAlaSerProLysGlyPro 
1081 GGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTC 
CCCACACGACCTGACAGATCGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCC^^ 

VcilIleGliiMetTyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProGl^ 
114 1 CTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAG 
GACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCG^ 

SerArgSerLeua3irPrcDC5'sThrCysGlySerSerAspLeuTyrLeuVcil 
120 1 GTAGCCGCTCAlTOACACCCTGCACTTGCGGCTCCTCGGACCTTTACCaX^GTCACG 
CATCGGCGAGTAACTGTCGGACGTCAACGCCGAGGAGCCTGGAAATGGACC^ 

AlaAspVallleProValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArg 
126 1 ACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCC 
OXSCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGG 

ProIleSerTyrLeiiLysGlySerSerGlyGlyProLeulfiuCysProAlaGlyHisAla 
1321 GGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACG 
CCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGC 

ValGlyllePlxeArgAlaAlaValCysThrArgGlyValAlaLysAlaValAspPhelle 
1381 CCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTA 
GGCACCCGTATAAATCCCGGCGCCACAC6TGGGCACCTCACCGATTCCGCCACCTGAAAT 

ProValGliiAsnLeuGluThrThmetArgSerProValPheThrAspAsnSerStt 
1441 TCCCTGTGGAGAACCiawaGACAACCAaXSAGGTCCCCGGTGTTCACGGAa^ 

AGGGACACCXCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAG 

ProValValProGlnSerPheGlnValAlaHisLeiiHisAlaProThrGlySerGlyLys 
1501 CACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAGCGGCA 
GTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCG 

SerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysValLeuValLexiAsnPro 
1561 AAAGCACCAAGGTCCCGGCTCCAa!ATGCAGCTCAGGGCTATAAGGTGCTAGa!ACTC^ 
TTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATC^ 

SerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLysAlaHisGlylleAspPro 
1621 CCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCC^ 

GGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAG 

AsnlleArgThrGlyValArgThrlleThrThrGlySerProIleThrTyrSerO^^^ 
1681 CTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCA^ 
GATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATG^ 

GlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspIlellelleCysAsp 
1741 ACGGCAAGTTCCTTGCCGACGGCGGGTGCXCGGGGGGCGCTTATGACATAATAATTTGTC 
TGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTAraAAACAC 

GluCysHisSerThrAspAlaltoSerlleLeuGlylleGlyThrValLeuAspGln^ 
1801 ACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGACCAAG 
TGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTCACAGGAACTGGTTC 

GluThrAlaGlyAlciArgLeuValValLeiiAlaThrAlaThrProProGlySerValThr 
1861 CAGAGACTGCGGGGGCGAGACTGGTTGTCCTCGCCACCGCCACCCCTCCGGGCTCCGTCA 
GTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGT 

FIG. 26-2 
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ValProHisProAsnlleGluGluValAlaLeuSerThrThrGlyGluIleProPheTyr 
1921 CTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCTTTTT 
GACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGAAAAA 

GlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeuIlePheCysHisSer 
1981 ACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGTCATT 
TGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACAG^ 

LysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGlylleAsnAlaValAla 
2041 caaagaagaagtgcgacgaactcgccgcaaaSctggtcgcattgggcatcaatgccgtgg 
gtttctocttcacgctgcotksagcggcgtttcgaccagcgtaac^ 

TyrTyrArgGlyl^uAspValSerVallleProThrSerGlyAspValValValValAla 
2101 CCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGG 
GGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCA^ 

ThrAspAlaLeiiMetThrGlyTyrThrGlyAspPheAspSerVallleAspCysAsnThr 
2161 CAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAG^ 

GTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTA 

CysValThrGlnThrValAspPheSerLeuAspProThrPhelSirlleGluThrlleT^ 
2221 CGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTT^^ 

GCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGT 

LeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThrGlyArgGlyLysPr^ 
2281 CGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGGAAGC 
GCGAGGGG(5TCCTACGACAGAGGGCGa?GAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCG 

GlylleTyrArgPheValAlaProGlyGluArgProSerGlyMetPheAspSerSerVal 
2341 CAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTCGTCCG 
GTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGC 

LeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeuThrProAlaGluTh^ 
24 01 TCCTCTGTGAGTGCTATCACGCAGGCTGTGCTTCGTATGAGCTCACGCCCGCCGAGACT^ 
AGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTCTGAT 

ValArgLeiiArgAlaTyrMetAsnThrProGlyLeuProValCysGlnAspHisLeuGlu 
2461 CAGTTAGGCTAGGAGCGTACATCAACACCCCGGGGCTTCCCGTGTGCCAGGACCATCTTG 
GTCAATCCGATGCTCGCATGTAGTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTAGAAC 

PheTrpGluGlyValPheThrGlyLeuThrHisIleAspAlaHisPheLeuSerGlnThr 
2521 AATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATAaaVGATGCCCACTTTCiy^TC 

TTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGGGTCT 

LysGlnSerGlyGluAsiiLeuProTyrLeuValAlaTyrGlnAlaT^ 
2581 CAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGCGCTA 
GTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACGCG^^ 

AlaGliiAlaProProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeuLysPro 
264 1 GGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTCAAGC 
CCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAGTTCG 

ThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGluIleThr 
2701 CCACCCTCCATGGGCCAACACCCCTGCTATACAGACXGGGCGCTGTTCAGAATGAAATCA 
GGTGGGAGGTACCCGGTTGTGGGGACGAXATGTCTGACCCGCGACAAGTCTTACTTTAGT 

LeuXhrHisProValThrLysTyrlleMetllirCysMetSerAlaAspI^uGluVal^^ 
2761 CCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAGGTCG 
GGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTCCAGC 

ThrSerThrTrpVall^uValGlyGlyValLeuAlaAlal^iiAlaAlaTyrt 
2821 TCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGT 
AGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACA 

ThrGlyCysValVallleValGlyArgValValLeuSerGlyLysProAlallellePro 
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2881 CAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATAC 
GTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCra 

AspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeuPro 
2941 CTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTAC 
GACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATG 

TyrlleGluGlnGlyMetMetlieiiAlaGluGlnPheLysGlnLysAlaLeuGlyLeuL^ 
3001 CGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTC2U^GCAGAAGGCCCTC^^ 

GCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 

GlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLys 
3051 TGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGC^^ 
ACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTGTGGTTGACCGTOT 

LeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAla 
3121 AACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGG 
TTCAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGT^^ 

GlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAla 
3181 CGGGCTOXSTCAACGCaXKICTGGTAACCCCGCCATTGCTTCATTGATC^ 

GCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGAC 

ValThrSerProIieuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGlyTrpV^ 
3241 CTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGGTGGG 
GACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCCACC^ 

AlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeuAlaGly 
3301 TGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTG 
ACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGAC 

AlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGlyTyrGl^ 
3361 GCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATC 
CGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCCA^ 

AlaGlyValAlaGlyAlal^uValAlaPheLysIleMetSerGlyGluValProSerThr 
3421 GOSCGGGCGTGGCGGGAGCTCTTCTGGCATTCAAGATCATGAGCGGTGAGGTCCCCTCCA 
CGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGGAGGT 

GliiAspLeuValAsnLeuLeuProAlalleLeiiSerProGlyAlaLeuValValGlyVal 
3481 CGGAGGACCTGGOXIAATCTACTGCCCGCCArcCTCTCGCCCGGAGCCCTCGXAGTCGGC^ 
GCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGC 

ValCysAlaAlalleLeiaArgArgHisValGlyProGlyGluGlyAlaValGlnTrpMet 
3541 TGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGA 
ACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGI^ 

AsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProThrHisTyrValPro 
3601 TGAACCGGCTGATAGCCTTCGCCOJCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGC 
ACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCACG 

GluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerlieuThrValThrGlnLeu 
3661 CGGAGAGCGAOXSCAGCTGCCCGCGTCACTGCCATACTC AGCAGCCTCACTGTAACCCAGC 
GCCTCTCGCTACGTC^CGGGCGCAGTGACGGTATGAGTCGTCGGAGaXSAC^^ 

LeuArgArgLeiiHisGlnXrpIleSerSerGluCysThrThrProCysSerGlySerTirp 
3721 TCCTGAGGCX3ACTCCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTC 

AGGACTCCGCOX^ACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAGGA 

LeiiArgAspIleTrpAspTrplleCysGluValLeuSerAspPheLysThrTrpLeuLys 
3781 GGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCTAA 
CCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGA 

AlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGlyTyrLys 
3841 AAGC1!AAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTATA 
TTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCATAT 
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GlyValTrpArgValAspGlylleMetHisThrArgCysHisCysGlyAlaGluIleThr 
3^01 AGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTG6AGCTGAGATCA 
TCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGOXSACACCTCGACTCTAGT 

GlyHisValLysAsnGlyThrMetArglleValGlyProArgThrCysArgAsnMetTrp 
3961 CTGGACATGTCAA2^CGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAACATGT 
GACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTTGTACA 

SerGlyThrPheProIleAsnAlaTyrThrTlirGlyProCysTlirPr^ 
4021 GGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCTGCGC 
CCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGACGCG 

AsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyrValGluIleArgGl^^ 
4081 CGAACTACACGTTCGCGCTATGGAGGGTGTCa?GCAGAGGAATATGTGGAGATAAGGCAGG 
GCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCCGTC^ 

GlyAspPheHisTyrValThrGlyMetThrThrAspAsiiLeuLysQirsProC^ 
4141 TGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCC^^ 

ACCCCCTGAAGGTGATCCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACGGTCC 

ProSerPrc5GluPhePheThrGluLeiiAspGlyValArgLeuHisArgPheAlaPTO 
4201 TCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGOT 

AGGG^EAGCGGGCTTAAA2VAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGCGGGG 

CysLysProLexiLeuArgGluGluValSerPheArgValGlyIieiiHisGluT3rrProVal 
4261 CCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATACCCGG 
GGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTGATCCTGAGGTGCTTATGGGCC 

GlySerGlnIieuPrc<:ysGlxiProGluProAspValAlaValLeuThrSerMetLeuThr 
4321 TAGGGTCGCAATa!ACCa?TCCGAGCCCGAACCGGACGTCGCCGTGTTGACGTCCATC 

ATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCOJGCACCGGCACAACTGCAGGTACG^ 

AspProSerHisIleThrAlaGltiiaaAlaGlyArgArglieuAlaArgGlySerProPro 
4381 CTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATCAC^^ 
GACTAGGGAGGGXATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGG 

SerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAlaThr<:ysTte 
4441 CCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTAOXICGCTCCATCTCtCAAGGCAAC^ 

GGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGT • 

AlaAsnHisAspSerProAspAlaGluLeuIleGltiAlaAsnLeuLeuTrpArgGlnG^^ 
4501 CCGCTAACCAXGACTCCCCOX^ATGCTGAGCTCATAGAGGCCAACCTCCTATGC^ 

GGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCOSGTTGGAGGATACCTC^ 

MetGlyGlyAsnlleThrArgValGluSerGluAsiiLysValV'allleLeiaAspSerPhe 
4561 AGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTCGTGATTC^ 
TCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTT^ 

AspProLeuValAlaGluGluAspGlxiArgGluIleSerValProAlaGluIleLeuArg 
4 621 TCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATCCTGC 
AGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACG 

LysSerArgArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsnProPro 
4681 GGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCC 
CCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGG 

LeuValGluThrTrpLysLysProAspTyrGluProProValValHisGlyCysProIieu 
4741 CGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGC 
GCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCG 

ProProProLysSerProProValProProProArgLysLysArgThrValValLeuThr 
4801 TTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTCGTCCTC^ 
AAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGT 

GluSerThrl^uSerThrAlaLeuAlaGluLeuAlaThrArgSerPheGlySerSerSer 

FIG. 26-5 
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4861 CTGAATCiUVCCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGC^^ 

GACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGA 

ThrSerGlylleThrGlyAspAsnThrThrThrSerSerGluProAlaProSerGlyCys 

4921 caacttccggcattacgggcgacaatacgacaacAtcctctgagcccgccccct 
gttgaaggccgtaatgcccgcxgttatcctgragtaggagactcgggcgggga^^ 

ProProAspSerAspAlaGluSerTyrSerSerMetProProLeuGluGlyGluProGly 
4981 GCCCCCCCGACTCCGACGCXGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTG 
CGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTCGGAC 



GluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerl^uLeuArgHisHisAsnLeu 
5161 OKAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTA^ 
GCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCA^ 



5041 




5101 



ValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeuValThrProCysAlaAla 
ATGTCGTGTGCTCCTCAATGTCTTACTCTTGGACAGGCGCACTCGTC^^ 
TACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACGC^ 



5221 



ValTyrSerOlirThrSerArgSerAlaCysGliiArgGlnLysLysValThrPheAsp 
TGGTGTATTCGACCACCTCACGCAGTGCXTGCCAAAGGCAGAAGAAAGTCACATTTC 7 
ACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAAACTGT 



5281 




5341 



LysValLysAlaAsnLeu 
CAAAAGTGAAGGCTAACTTG 
GCTXTCACTTCCGATTGAAC 



FIG. 26-6 
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FIG. 27 Translation of DNA 12f 

IlePheLysIleArgMetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsn 
1 CCATATTTAAAATCAGGATGTACGOXSGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGT 

TrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGT 
TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

LeiiLeuThrThrThrGlnTrpGlnValLeuProCysSerPheTlirThrLeuProAlaLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCT 
ATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGA 

S erThrGly Leu IleHisLeuHi sGlnAsn IleVal AspValGlnTyrLeuTyrGly Val 
181 TCTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCC 



GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPhel^uLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 



LeuAlaAspAlaArgValCysSerCysLeuTrpKetKetLeuLeuIleSerGlnAlaGlu 
301 TGCTTCCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCC^ 

ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

Overlap with 14i 

AlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAGAACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 



Val 
421 TTGTATC 
AACATAG 
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FIG. 28 Translation of DNA 35f 



61 



121 



181 



241 



301 



361 



-Overlap with 39c- 



LeuLysGluValI*ysAlaAlaAlaSerLysValLysAlaAsnLeuI,eiiSerValGluGlu 
TGCTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGG 
ACGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCC 

AlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAsp 
AAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTCGTTATGGGGCAAAA 
TTCGAACeTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTC 

ValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeu 
ACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTC 
TCCAGGCAACGGTACGGTCTTTCCGGCATTCGGTGTAGTTGAGGCACACCTra^ 

GliiAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysVal 
TGGAAGACAATGTAACACC?VATAGACACTACCATCATGGCTAAGAACGAGGTTTT^ 
ACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGC 

GlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyVal 
TTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCG 
AAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGC 

ArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMet 
TGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGA 
ACGCGCACACGCTTTTCTACGGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGC^^ 

GlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAla 
TGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAG 
ACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTCTCGCCCAACTTAAGGAGCACGTTC 

TrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThr 
421 CGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCA 
GCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGT 

ValThrGluSerAsp IleArgThrGluGluAla 
481 CAGTCACTGAGAGCGACATCCGTACGGAGGAGGCA 
GTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGT 
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FIG. 29 Translation of DNA 19g 



GluPheLeuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThr 
1 GAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGAT^^ 
CTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGG 

Overlap with 35f 

ArgCysPheAspSerThrValThrGluSerAspIleArgThrGluGliiAlalleTy^ 
6 1 CGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAA 
GCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTT 

CysCysAspLeuAspProGlnAlaArgValAlall^ysSerLeuThrGluArgLeuTyr 
121 TGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCgTCACCGAGAGGCTTTAT 
ACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATA 

ValGlyGlyProLeuThrAsnSerArgGlyGlxiAsnCysGlyTyrArgArgCysArgAla 
181 GTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGGAGGTGCCGCGCG 
CAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGC 

SerGlyVall^uThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAla 
241 AGCGGCGTACTGACAACTAGCTGTCGTAACACCCTCACTTGCTACATCAAGGCCCGGGCA 
TCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCC^ 

AlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuVal 
301 GCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTAGTC 
CGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAG 

VallleCysGluSerAlaGlyValGlnGluAspAlaAla 
361 GTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCGAG 
CAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTC 
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FIG. 30 Translation of DNA 26g 



GlyGlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeuThrThrSerCys 
1 GGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCAAGCGGCGTACTGACAACTAGCTGT 
CCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGTTCGCCGCATGACTGTTGATCGACA 



GlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGln 
61 GGTAACACCCTCACTTGTTACATCAAGGCCCGAGCAGCCTGTCGAGCCGCAGGGCTCCAG 
CCATTGTGGGAGTGAACAATGTAGTTCCGGGCTCGTCGGACAGCTCGGCGTCCCGAGGTC 

— ■■ — Overlap with 19g • — 

AspCysThrMetLeuValCysGlyAspAspLeuValVallleCysGluSerAlaGlyVal 
121 GACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTC 
CTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAG 



GlnGluAspAlaAlaSerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaPro 
181 CAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCC 
GTCCTCCTGCGCCGCTCGGACTCTCGGAAGOXSCCTCCGATACTGGTCCATGAGGCGGGGG 

ProGlyAspProProGlnProGluTyrAspLeuGloLeuIleThrSerCysSerSerAsn 
241 CCTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAAC 
GGACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTG 

ValSerValAlaHisAspGlyAlaGlyLysArgValTyrTyrLeuThrArgAspProThr 
301 GTGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACA 
CACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATG 

ThrPror,eiiAlaArgAlaAlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeu 
361 ACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTA 
TGGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGAT 

GlyAsn IlelleMetPheAlaProThrLeuTrpAla 
421 GGCAACATAATCATGTTTGCCCCCACACTGTGGGCG 
CCGTTGTATTAGTACAAACGGGGGTCTGACACCCGC 

FIG. 31 Translation of DNA 15e 



GlyAlaGlyLysArgValTyrTyrlieuThrArgAspProThrThrProLeuAlaArgAla 
1 CGGCGCTGGAAA6AGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGC 
GCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCTCG 

Overlap with 26g 

AlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeuGlyAsnllelleMetPhe 
6 1 TGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATGTT 
ACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAA 



AlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePheSerValLeuIleAla 
121 TGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATAGC 
ACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCG 

ArgAspGlnLeuGluGlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlleGlu 
181 CAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATAGA 
GTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCT 

ProLeuAspLeuProProIlelleGlnArgLeu 
241 ACCACTTGATCTACCTCCAATCATTCAAAGACTC 
TGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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FIG. 32*1 COMBINED ORF OF DMAs 12f through 15e 



IlePheLysIleArgMetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsn 
1 CCAOIATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGT 

TrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGC^ 

TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

LeuLeuThrThrThrGlnTrpGlnValLeuProCysSerPheThrThrLeiiProAlaLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACC^^ 

Aa?GACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGG 

SerThrGlyLeuIleHisLeuHisGlnAsnlleValAspValGlnTyrLeuTyirGlyVal 
181 TGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAGTACITGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGOX^GTCTTCTAACACCTGCACGTCATGAACATGCCCC 

GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 

LeuAlaAspAlaArgValCysSerCysLeuTrpMetMetJlieuLeuIleSerGlnAlaGlu 
301 TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

AlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAGAACCTCGTAATACraAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 

ValSerPheLeuValPhePheCysPheAlaTarpTyrLeuLysGlyLysTrpValProGly 
421 TTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTATTTGAAGGGTAAGTGGGTGCCCG 
AACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCC^CG^ 

AlaValTyrThrPheTyrGlyMetTrpProLeuLeuLeiiLeuLeuLeiiAlaLeuProGl 
481 GAGCGGTCTACACCTTCTACGGGATGTGGCCTCTCCTCCTGCOXZCTGTTCGCGTTGCCCC 
CTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGG 

ArgAlaTyrAlaLeuAspThrGluValAlaAlaSerCysGlyGlyValValLeuValGly 
541 AGCGGGCGTACGCC3CTGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCG 
TCGCCCGCATGCGCGACCTGTCCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAG 

LeuMetAlal^uThrLeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrp 
601 GGTTGATGGCGCTGACa?CTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGT^ 
CCAACTACCGCGACTGAGACAGTGGTATAAOXSTTCGCGATATAGTCGACCACGAACACCA 

LeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsn 
661 GGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCA 
CCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGT 

ValArgGlyGlyArgAspAlaVallleLeuLeuMetCysAlaValHisProThrLeuVal 
721 ACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGG 
OXSCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACC 

PheAspIleThrLysLeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAlaSer 
781 TATTTGACATCACCAAATTCCTGCTCGCCGTCTTCGGACCCCTTTGGATTCTTC^ 

ATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGT 

LeuLeuLysValProTyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeviAla 
841 GTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAG 
CAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATC 

ArgLysMetlleGlyGlyHisTyrValGlnMetValllelleLysLeuGlyAlaLeuThr 
901 CGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCTTA 
GCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATOCAATCCCCGCGAAT 
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GlyThrTyrValTyrAsnHisIieuThrProLeiiArgAspTrpAlaHisAsnGlyLexiteg 
961 CTGGCACCTATGTTTATAACCAXCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGC 
GACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACG 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
1021 GAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCA 
CTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGT 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
1081 CGTGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTOT 

GCACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGG 

ArgGlyArgGluIleLeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeu 
114 1 GCAGGGGCCGGGAGATACOXSCTCGGGCCAGCCGATdGAATGGTGTCCAAGGGGTGGAGGT 
CGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCA 

LeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysIlelleThr 
1201 TGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCA 
ACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGT 

SerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGlnlleValSerThirAla 
1261 CCAGCCTAACTGGCCGGGACAAAAACCAAGa?GGAGGGTGAGGTCCAGATTGTGTCAACTG 
GGTCGGATaXSACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGAC 

AlaGlnThrPheLeuAlaThrCysIleAsnGlyVcilCysTrpThrValTyrHisGlyAla 
1321 CaX3CCCAAACCTTCCTGGCAACGaX3CATCAATGGGGTGTGCTGGACTC^ 

GACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCC 

GlyThrArgThrlleAlaSerProLysGlyProVallleGlnMetTyrThrAsnValAsp 
1381 CCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAG 
GGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATC 

GlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerlreuThrProCysThrCys 
1441 ACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTT 
TGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGa?GGGACGTGAA 

GlySerSerAspIjeuTyrLeuValThrArgHisAlaAspValTleProValArgArgArg 
1501 GCGGCTCCTCGGACCTTOACCTGGTCACGAGGCACGCCGATGTGATTCCCGTGCGCCGGC 

CGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCG . 

GlyAspSerArgGlySerLeuLeuSerProArgProIleSera?yrLeuLysGlySerSer 

15 61 GGGGTGATAGCAGGGGCAGCCTGCTGa?CGCCCCGGCCCATTTCCTACTa?GAAAGGCTCCT 

CCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATG2UVCTTTCCGAGGA 

GlyGlyProLexiLeuCysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCys 

16 21 CGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGT 

GCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACA 

ThrArgGlyValAlaLysAlaValAspPKelleProValGluAsnLeuGluThrThrMet 
1681 GCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCOTGTGGAGAACCTAGAGACAACCA 
CGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGT 

ArgSerProValPheThrAspAsnSerSerProProValValProGlnSerPheGlnVal 
174 1 TGAGGXCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCT 

ACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCC 

AlciHisLeuHisAlaProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAla 
1801 TGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATC 
ACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTAT^^ 

AlaGlnGlyTyrLysValLeuValLeuAsnProSerValAlaAlaThrlieuGlyPheGly 
1861 CAGCTCAGGGCTATAAGGOXSCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCT^ 

GTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGTTGTCACCCGAAAC 

AlaTyrMetSerLysAlaHlsGlylleAspProAsnlleArgThrGlyValArgThrlle 

FIG. 32-2 
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1921 GTGCTTACATGTCCAAGGCa?CATGGGATCGATCCtAACAT(^ 

GACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTO? 

ThrThrGlySerProIleThrTyrSerThrTj^lyLysPheLeuAlaAs^ 
a?TACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGa?TCCT!rGCCGACGGCGGGT 
AATGGTGACCGTCGGGGTAGTGCATGAGGTCGATGCCGTTCAAGGAACGGCTGCCGCCC^ 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
GCTCGGGGGGCGCTTATGACATAATAATTTGOXSACGAGTGCCACTCCACGGATGCCACAT 
CGAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTA 

lleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAiaGlyAlaArgLeuValV^ 
CCATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTG^ 
GGTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAAC 

LeiiAlaThrAlaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
TGCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCG 
ACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGa?GACACGGGGa!AGGGTTGTAGCTCCTCC 

AlaLeuSerThrTh3:GlyGluIleProPhea?yrGlyLysAlaIleProrieuGluVcaile 
TTGCTCOXSTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGT 
AACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATT 

LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
aXAAGGGGGGGAGACATCTCATCTTCTGOXiyVTTCAAAGAAGAAGTGCGACG^ 
AGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGC 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
CAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCG 
GTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGC 

IleProThrSerGlyAspValValValValAlaa?hrAspAlaLeuMetThrGlyTyr^ 
TCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATA 
AGTAGGGCTGGTCGCaX!a]ACAACAGCAGCACCGITCGCTACGGGAGTA^ 

GlyAspPheAspSerVallleAspCysAsnThrCysValThrGlnThrValAspPheSer 
CCGGCGACTTCGACTCGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATT^ 
GGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGT 

I^uAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
GCCTTGACCCTACCTTCACCATTCAGACAATCACGCPCCCCCAGGATGCTC 
CGGAACTGGGATGGAAGOX^GTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAG^ 

GinArgArgGlyArgThrGlyArgGlyLysProGlyllel^ArgPheValJaaProGl^ 
CTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTCl^ 
GAGTTCCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACAC 

GluArgProSerGlyMetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCys 
GGGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGC^^ 
CCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGA 

MaTrpTyrGliiI^uThrProAlaGluThrThrValArgLeuArgAlaTyrMet^ 
GTGCTTGGTATGAGCTCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAA 
CACGAACCATACTCGAGTGCGGGCGGCaxnJGATGOXIAATCCGATGCTCGCATC 

ProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeu 
CCCCGGGGCTTCCCGTGTGCCAGGACCATGTTGAATTTTGGGAGGGCGTCTTT^ 
GGGGCCCCGAAGGGCACACGGTCCTGGTAGAACOTAAAACCCTCCCGCAGAAATGTCCGG 

ThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyr 
TCACTCATATAGATGCCCACmTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTT 
AGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAA 

LeuValAlaTyrGlnAlaThrValCysAl.aArgAlaGlnAlaProProProSerTrpAsp 
ACCTGGTAGCGTACCAAGCCACCGTGO^GCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGG 
TGGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCAC^^ 
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GlnMetTrpIiysCysLeuIleArgLeuLysProThrl^uHisGlyProThrProLeii^ 
2941 ACCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTC 
TGGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTC 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
30Q1 TATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATAC^ 
ATATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGT 

MetThrCysMetSerAlaAspIieuGluValValThrSerThrTrpValLeuValGl 
3061 TCATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGC 
AGTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGC 

ValLeuAlaAlaLeuAlaAlaTyrC^-sLeuSerThrGlyCysValVallleValGl 
3121 GCGTCCTGGCTGCTTTGGCOSCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTG^ 

CGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGT 

ValValLeuSerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPhe 
3181 GGGTCGTCTOXSTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGT 
CCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCA 

AspGliaMetGluGluCysSerGlnHlsLeuProTyrlleGluGlnGlyMetMetLeiiM 
3241 TCGATGAGATGGAAGAGTGCTCTCAGCACTXACCGTACATCGAGCAAGGGATGATGCTCG 
AGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGC 

GluGlnPheLysGlnLysAlalieuGlyLeuIieuGlnThrAlaSerArgGlnAlaGluVal 
3301 CCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGG 
GGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCC 

IleAlaProAlaVcaGlnThrAsnTrpGlnLysIieuGluThrPheTrpAlaLysHisMet 
3361 TTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACa?CGAGACCTTCTGGGCGAAGCATA 
AATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTAT 

TrpAsnPheIleSerGlyIleGlnTyrI.euAlaGlyIieuSerThrLeuProGlyAsnPro 
3421 TGTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAAC^ 
ACACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGG 

AlaIleAlaSerI^iiMeiAlaPhel±LrAlaAlaValThrSerProI^uThrThrSerGln 
3481 CCGCCATTGCTTCATTCATGGCTTTTACAGCTGCTGTCACCAGCCC^^ 

GGCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGG 

ThrLeuIeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeiiAlaAlaPrc^ 
3541 AAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCGAGCTCGCCGCCCCCGGTG 
TTTCGGAGGAGAAGTa?GTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCAC 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 
3601 CCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTCGACTC^ 
GGCGATGACGGAAACACCCGCGACOSAATCGACCGCGGCGGTAGCCGTCACAACCTGAC^ 

LysValLeulleAspIleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAla 
3661 GGAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGG 
CCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACC 

PheLysIleMetSerGlyGluValProSerThrGliiAspLeuYalAsnLeiiLeuProAla 
3721 CATTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCG 
GTAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGC 

IleLeiiSerProGlyAlaLeuValValGlyValValCysAlaAlalleLeuArgArgHis 
3781 CCATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGC 
GGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCG 

ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeulleAlaPheAlaSerArg 
3 84 1 ACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCC 
TGCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGG 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 
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3901^ GGGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCA 
CCCCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGC^ 

AlalleLeuSerSerl^uThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSer 
3961 CTGCCATACTCAGCAGCCIIXACTGTAACCCAGCTCCTGAGGCGACTGC^^ 

GACGGTATGAGTCGTCGGAGTGACATTCGGTCGAGGACTCCGCTCACGTGGTCACCTAra 

SexGluCysThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCys 
4021 GCTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATAT 
CGAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCUATA 

GluValLeuSerAspPheLysThrTrpLeuLysAlaLysLexiMetProGlnLeuProGly 
4081 GCGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTG 
CGCTCCACiyvCTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTC^ 

IleProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
4141 GGATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCA 
CCTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGT 

HisThrArgCysHlsCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArg 
4201 TGCACACTCGCTGCCACTGTCGAGCTCAGATCACa?GGACATGTC^^ 

ACGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACT 

IleValGlyProArgThrC^sArgAsnMetTrpSerGlyThrPheProIleAsnAlaTy^ 
4261 GGATCGTCGGTCCTAGGACCTGCAGGAACATGTCGAGTGGGACCTTCCCCATTAATGCCT 
CCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGG 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
4321 ACACCACGGGCCCCTGOIACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGG 
TGTGGTGCCCGGGGACAOXSGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCC 

SerAlaGluGluTyrVeilGluIleArgGlnValGlyAspPheHisTyrValThrGlyMet 
4381 TGTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTA 
ACAGACGTCTCCTTATACACCTCOa^TTCCGTCCACCCCCTGAAGGTGATGCACTGCCC^ 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGliiLeu 
4441 TG ACTACTG ACAATCax::AAATGCCCGTGCCAGGTCCCATCGCCCG AATTTTTCA 

ACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGtGTCTTA 

AspGlyValArgl^uHisArgPheAlaProPrcKZysLysProLeuLeuArgGluGluVal 
4501 TGGACGGGGTGCGCCaiACATAGGa?TTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGG 
ACCTGCCCCACGCGGATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCC 

SerPheArgValGlyLeuHisGluTyrProValGlySerGlnLeuProCysGluProGlu 
4561 TATCATTCAGAGTAGGACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCG 

ATAGTAAGTCTCATCCTGAGGTGCTTATGGGCGATCCCAGCGTOTATGGAACGCTCQ 

ProAspValAlaValLeuThrSerMetLeuThrAspProSerHisIleThrAlaGltiAla 
4621 AACCGGACGTGGCCGTGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGC^^ 
TTGGCCTCCACCGGCAC2^CTGCAGGTACGAGTGACTAGGGAGGGTATATTCTCGT^ 

AlaGlyArgArgLetiAlaArgGlySerProProSerValAlaSerSerSerAlaSerGln 
4681 CGGCCGGGCGAAGGTTGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTA^ 
GCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGG 

LeiiSerAlaProSerLexiLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGlu 
4741 AGCTATCCGCTCCATCTCTCAAGGCAACTTCCACCGCTAACCATGACTCC^ 

TCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTCGCGATTGGTACTGAGGGGACTACGAC 

LeuIleGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlyAsnlleThrArgValGlu 
4801 AGCTCATAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTG 
TCGAGTATCTCCGGTa?GGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAAC 

SerGluAsnLysValVallleLeuAspSerPheAspProLeuValAlaGluGluAspGlu 
4861 AGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACG 
rcAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTC^ 
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ArgGluIleSerValProAlaGluIleLeuArgLysSerArgArgPheAlaGlnAlaLeu 
4921 AGCGGGAGATCTCCGTACGCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCC 
TCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGG 

ProValTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysProAsp 
4981 TCCCCGITTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCG 
ACGGGCAAACCCGCGCCGGCCTGATATOXSGGGGGCGATCACCTCTGCACCTTTTTCGGGC 

TyrGluProProValValHisGlyCysProLeuProProProLysSerProProValPro 
5041 ACTACGAACCACCTGTGGTCCATGGCTGTCCQCTTCCACCTCCAAAGTCCCCTCCTGTGC 
TGATGCTOSGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACG 

ProProArgLysLysArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuAla 
5101 CTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGG 
GAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACC 

GluLeiiAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThr 
5161 CCGAGCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATA 
GGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTAT 

ThrThrSerSerGluProAlaProSerGlyCysProProAspSerAspAlaGlxiSerTyr 
5221 CGACAACATCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCG ACGCTGAGTCCT 
GCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGA 

SerSerMetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrp 
5281 ATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCAT 
TAAGGAGGTiVCGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTA 

SerThrValSerSerGluAlaAsnAlaGltiAspValValCysCysSerMetSerTyrSer 
5341 GGTCAACGGTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTOTTACT 
CCAGTTGCCAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGA 

TrpThrGlyAlaLeuValThrProCysAlaAlaGluGluGlnLysLeuProlleAsnAla 
5401 CTTGGACAGGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATG 
GAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTAC 

LeuSerAsnSerl^iiIieuArgHisHisAsnlieuValaYrSerThrThrSerArgSerAla 
5461 CACTAAGCAACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
GTGATTCGTTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 

CysGlnArgGlnLysLysValThrPheAspArgLeuGlnValLeuAspSerHisTyrGln 
5521 CTTGCCAAAGGCAGAAGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCA^^^ 
GAACGGTraCCGTCTTCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAAT 

AspValLeiiLysGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLexiSerVal 
5581 AGGACGTACTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCG 
TCCTGCATGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGC 

GluGliiAlaCysSerLeuThrProProHlsSerAlaLysSerLysPheGlyTyrGlyAla 
5641 TAGAGGAAGCTTCCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATC^ 

ATCTCCTTCGAACGTCGGACTGCGGGGGXGTGAGTCGGTTTAGGTTCAAACCAATACCCC 

LysAspValArgCysHisAlaArgriysAlaValThrHisIleAsnSerValTrpLysAsp 
5701 CAAAAGACGTCCGTrKKICATGCCAGAAAGGCCGTAACCCACATC^ 

GTTTTCTGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTC 

LeiiLeuGluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPhe 
5761 ACCTTCTGGAAGACIAATGTAACACCAATAGACACTACCATCATGGCTAAGAAC^ 

TGGAAGACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAA 

CysValGlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeu 
5821 TCTGCGTTC AGCCTGAGAAGGGGGGTCGTAAGCCAGiCTCGTCTCATCGTGTTCCCCGATC 
AGACGCAAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAG 

GlyValArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAla 
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5881 TGGGCGTGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCT^ 
ACCCGGACGCGCACACGCTTTTCTACCGAAACATGCTGC^CCAATCTTTC^ 

ValMetGlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuV 
5941 CCGTGATGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTC^ 
GGCACTACCCTTCGAGGATGCCTl^GGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGC 

GlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThrZ^rgCysPheAsp 

6001 TGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTG 
ACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAAC 

SerThrValThrGluSerAspIleZ^gThrGluGliiAlalleTyrGlnCysCysA^p^ 
6061 ACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTC 

TGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGAa?GGTTACAACACax;G 

AspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyPro 
6121 TCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATGTTGGGGGCC 
AGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGG 

LeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeu 
6181 CTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTAC 
GAGAATGGTa?AAGTTCCCCCCTCTa?GACGCCGATAGCGTCCACGGCGCGCTCGCCGCATG 

ThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAla 
6241 TGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCT^ 

ACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTC 

AlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuValVallleCysGlu 
6301 CCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTC 
GGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACAC 

SerAlaGlyValGlnGluAspAlaAlaSerLeiiArgAlaPheThrGluAlaMetThrArg 
6361 AAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCA 
TTTCGCGCCCCCAGGTCCTCCTGCGCGGCTCGGACTCTCGGAAGTGCCTCCGATACTGGT 

TyrSerAlaProProGlyAspProProGlnProGluTyrAspLeuGltiLeuIleThrSer 
6421 GGTACTCCGCCCCCCCTGGGGACCCCCCACZ^CCAGAATACGACTa?GGAGCTCATAACAT 
CCT^TGAGGCGGGGGGGACCCCOXSGGGGGTGTTGGTCTTATCCTCAACCTCGAGTAra 

CysSerSerAsaValSerValAlaHlsAspGlyAlaGlyLysArgValTyrTyrLeuThr 
6481 CAOXSCTCCTCCAACGTGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACT^ 
GTACGAGGAGGT3X5CACAGTCAGCGGGT<K:TGCCGCGACCTTTCTCCCAGATGATC 

ArgAspProThrThrProLeuAlaArgAlaAlaTrpGluThrAlaArgHisThrProVal 
6541 CCCGTGACCCTACAACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAG 
GGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTC^ 

AsnSerTrpLeuGlyAsnllelleMetPheAlaProThrLeuTrpAlaArgMetlleLeu 
6601 TCAATTCCTCGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATAC 
AGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTGACACCCGCTCCTACTATG 

ItetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGluGlnAlaLeuAspCy 
6661 TGATGACCCATTTCTa?TAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCCTCGATT 
ACTACTGGGTAAAGAAATCGGAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCTAA 

GluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuProProIlelleGlnArg 
6721 GCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCTACCTCCAATCATTCAAA 
CGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGt;^^ 

Leu 

6781 GACTC ^ p,^ 
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AcuteHAV 
AcuteHAV 
AcuteHAV 
AcuteHAV 
AcuteHAV 
AcuteHAV 

Convalescent HBV 
(anti-HBSag-t-ve; 

anti-HBCag+ve) 
(anti-HBSag+ve; 

anti-HBCag+ve) 
(anti-HBSag+ve; 

antl-HBCag*i-ve) 
( anti-HBSag+ve; 

ant i-HBSag+ve ) 



1354 
31 
14 
79 
26 
78 
87 
25 
60 
13 
298 
101 
474 
318 
20 
163 
44 
SO 
N/A 
N/A 
N/A 

N/A 
N/A 
N/A 

N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 
N/A 



Sequential serum samples were assayed from these patients 
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3 6*1 COOH-terminus of SOD-ClOO Fusion Polypeptide 

SOD — CCX)H] [ — adaptot 1 [NANBHpolypeptide> 

AlaCysGlyVallleGlylleAlaGlnAsnLeuGlylleArgAspAlaHisPhelLeuSer 
1 GCTTGTGGTGTAATTGGGATCGCCCAGAATTTGGGAATTCGGGATGCCCACTTTCTATC^ 
CGAACACCACATTAACCCTAGCGGGTCTTl^CCCTTAAGCCCTACGGGTGAAAGAa^ 

>>>>>>>>>>>>>>>>>>>>> 

GlnThrLysGlnSerGlyGliiAsnl^tiProTyrLeuValAlaTyrGlnAlaTta 
61 (»GACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGC 
GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGliiAlaProProProSerTrpAsp61nMetO?rpLysCysLeuIleArgLeu 
121 GCTAGGGCTCi^GCCCCTCCCCCATCGTGGGACCAGATCTGGAAGTGTTTG^ 

CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAI^CTAA^ 

LysProThrLeuHisGlyProThrPrdLeuLeuOJyrArgLeuGlyAlaVea 
181 AAGCCCACCCXCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAA 
TTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTAC^ 

IleThrLeuThrHisProValThrLysTyrll^etThrCysMetSeri^ 
241 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTG 

TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTC 

ValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeu^ 
301 GTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGC 
CAGCAGOKXITCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGC^ 

I^iiSearThrGlyCysValVallleVeaGlyArgValValLetiSerGlyLysProAlalle 
361 CTGTCAACAGGCTGCGTGGTCATAGOX^GGCAGGGTCGTCTTGTCCGG^ 

GACAGTTGTCaSACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTC 

IleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGltiHis 
421 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTCCTC^ 
TATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCT^ 

LetiProa?yrIleGluGlnGl5^etMetLeiiAlaGluGlnPheLysGlnLysAlaX,eu^ 
481 TTACCGTACATCGA6CAAGGGATGATGCTCGCCGAGCAGTTC 
AATGGCATGTMCTCGTTCCCTACaaVCGAGCGGCTCGTC^ 

LeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaPTO 
54 1 CTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATGGCCCCTCCTGTCCA^ 

GAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGT^ 

GlnLysLeuGluanirPheTrpAlaLysHisMetTrpAsnPheIleSe3:GlyIleGlnTy^ 
601 CAAZ^CTCGAGACCnrrCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGAT^ 
GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACC^ 

LeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMet^ 
661 TTG6C66GCTTGTCAACGCTGCCT6GTAACCCC6CCATTGCTTCA 

AACCGCCCGAAO^TTGCGACGGACCZmXSGGGCGGTAACGAAGTAACTACC^ 

Ala2U.aValThrSerProLeuThrThrSerGlnThrIieuLeuP^ 
721 GCTGCTGTCACCAGCCCACTAACCACaaGCCAAACCCTCGTCTTC^^ 

CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTCGGAGGAGAAGTTGTATAACCrc 

TrpValAlaAlaGlnLeuAlaAlaProGlyAlaAiaThrAlaPheValGlyAlaGlyLeu 
781 TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 

AlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 
84 1 GCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGG 
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cgaccgcggcggtagccetcacaacctgaccccttccaggagtatctgtaggaa^^^ 

TyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGluValP 
901 TATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGG 

ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGC^^ 

SerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValVal 
961 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTC 
AGGTGCCTCCTGGACCaGTT2W3ATGACGGGCGGTAGGAGAGCGGGCCnx:GGGAGCa^^ 

GlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAlaValGln 
10 21 GGCGTGGTCTGTGCAGCAATACroCGCCGGCACGTTGGCCCGGGCGA 

CCGCACCAGACACGTCGTTATGACGCGGCCGT6CAACCGGGCCCGCTCCCCCGTCACGTC 

<<<<<«<<<<<<<<<<«<NANBH] [ extra 

TrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProVcaHisHis 
1081 TGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCAGTCCATCAT 
ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGTCAGGTAGTA 

] 

LysArgOP 

1141 AAGCGTTGACGCTCCCTACGGGTGGACTCTGGAGAGACAGGGCACTGCTAAGGCCC^^ 
TTCGCAACTGCGAGGGATGCCCACCTGACACCTCTCTGTCCCGTCACGAO!^ 

1201 CTCAGCCATGCATCGAGGGG1!AC2VATCCGTATGGCCAAC^ 

GAGTCGGTACGTAGCTCCCCATGTTAGGCATACCGGTTGT^ 

1261 TCCTTTCTTOATGGTCCATACCTTAGATGCGTTAGCaTl^ 

AGGAAAGAGCTACCAGGTATGGAATCTACGCAATCGTAATTAGG^^ 



FIG. 36-2 
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FIG. 41-1 

Homology between the HCV polypeptide encoded by combined ORP of clones 
14 i through 39c) ^nd the non-structural protein of the Dengue flavi- 
virus (MNWVDl ) . 

10 20 30 40 50 

HCV EYVVIiLFIiLIiADARV'CSC^ 

MNWVDl AVSFVaiilTG^MSFRDLGRVMVMVGAO^^ 

130 140 150 160 170 180 

60 70 80 90 100 110 

HCV WYLKGKWVPGAVYTFYGMWPTiTtTiT.T.TrAIPQRAYAIiDTEVAASCGGV^ 

MNWVDl TSKELMMTTIGIVLLSQSTIPETILELTDAIiALG^^^ 

190 200 210 220 230 240 

120 130 140 150 160* 170 

HCV KRYISWCLWWLQYPLTKVEAQLHVWIPPLNVRGGRDAVILm 

• • • • 

MNWVDl NAVILQNAWKVSCTIIJWVSVSpiiiTSSQQKADra 

250 260 270 280 290 



C 



HCV 

MNWVDl 

HCV 
MNWVDl 

HCV 
MNWVDl 

HCV 
MNWVDl 

HCV 
MNWVDl 

HCV 
MNWVDl 



180 190 200 / 210 220 230 

FCPLWILQASIJ^KVPYF-VRVOGia^-CALARIOIIG 

KKRSWPLNEAIMAVGMVS ILASSIZiKNDIPMTGPLVAGGLLTV^ 
300 310 320 330 340 350 

240 250 260 270 280 290 

TPLRDWAHNGLRDIAVAVEPVVFSQMETKLITWGADTAACGDIING^^ 



ADVK-WEDQAEISGSSPILSITISB-DGSMSIKNEEEEQTLTILIRTGLLVISG LFP 

360 370 380 390 400 410 

300 310 320 330 340 350 

PADGMVSKGWRLrAPITAYAQQTRGIilXSCIITSLTGRDI^^ 

• , 

VSIPITAAAWYLWEVKKQRAGVLWDVPSPPPVGKAELEDGAYRIKQKGiriG 

420 430 440 450 460 470 



360 370 380 390 

INGVCWTVYHGAGTRTIASPKGPVTQMYTNVDQDLV- 



400 410 
—GWPAPQGSRSLTPCTCGSSD 



KEGTFHTMWHVTRGAVLMHKGKRIEPSWADVKKDLVSCGGGWKI^ 

480 490 500 510 520 530 

420 430 440 450 460 470 

LYLVTRHADVlPVRRRGDSRGSLLSPRPISYLKGSSGGPIliCPAGHAVGIFRAAVCT^ 
•* • »» •»» •••• s.s:»:.* * ** 

PGKNPRAVQTKPGLFKTN — ^AGTIGAVSIJDPSPGTSGSPIIDKKGKVVGLYGNGVVTRSG 
540 550 560 570 580 590 

480 490 500 510 520 530 

AKAVDFIPVENTJSTIMEISPVFTDNSSPPVVPQSFQVAHLHAPTGSGKS — TKVPAAYAAQ 



AYVSAIAQTEK— SIEDNPEIEDDIFRK- 
600 610 



-RKLTIMDLHPGAGKTKRYLPAIVRGAIKR 
620 630 640 



540 



550 
ft 



560 



570 



580 
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HCV GYKVLVI^PS— VAAmSPGAYMSKAHG 

MNWVDl GLRiilLAPTRVVAM^ 

650 660 670 680 690 700 

590 600 610 620 630 640 

HCV SGGAYDIIICDECHSTDATSILGIGTVIIXJAETAGARLVVIAT^ 

MNWVDl RW^n^NLIIMDE^^ 

710 720 730 740 750 760 

650 660 670 680 690 700 

HC\r ALSTT6EIPFYGKAIPLEVIKGGRHLIFCHSKKKCDELAAKLVAM 



MNWVDl IMDEEREIPERSWSSGHEWVTDFKGKTVWFVPSIKAGNDTAA<XRK^ 

770 780 790 800 810 820 

710 720 730 740 750 760 

HCV iPTSGDVVVVATDAmTGYTGDFDSVTDCNTGVTQTV^ 

MNWVDl SEYVKTRTNDWNFVVTTDISEMGANFKAERVTDPRRC^ 

830 840 850 860 870 880 

770 780 790 800 810 820 

HOT QRRGRTGRGKPGIYRFVAPGERPSGMFDSSVIiCECYDAGCAWYBLTPAETTVRI^ 

MNWVDl SS 



FIG. 41-2 
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FIG. 43 

DISTRIBUTIOW OF RANDOM SAMPLES 

C100-3 Ag ELISA Preclinical Kit 
416ng C100/WELL. 2 HRS 37^0. 20ul SAMPLE 
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FIG. 44 

Distribution of O.D« Values for 
Random Blood Donor Samples Tested with Two ELISA 

Configurations 

CI 00-3 Ag ELISA MoAB vs Polyclonal 
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FIG. 45 





Common Seouenc© 


Variable 5?^fiiifinr**a 


5'-3-l 


AAGCTTGATCGAATTC 


CGATCTTGC 


-2 




CGATCCTGC 


-3 




CGATCATGC 


-4 




CGATCGTGC 


-5 




CGAAGTTGC 


c 

— D 




t-tiAAGCTGC 


-7 




AGATCTTGC 


-8 




AGATCCTGC 


-9 




AGATCATGC 


-10 




AGATCGTGC 


-11 




AGAAGTTGC 


-12 




AGAAGCTGC 


-13 




CGATCTTGT 


-14 




CGATCCTGT 


-15 




CGATCATGT 


-16 




CGATCGTGT 


-17 " 




CGAAGTTGT 


1 Q 
— ± O 


• 


CGAAGCTGT 


-19 




AGATCTTGT 


-20 




AGATCCTGT 


-21 




AGATCATGT 


-22 




AGATCGTGT 


-23 




AGAAGTTGT 


-24 




AGAAGCTGT 


-25 




CGCTCTTGC 


-26 




CGCTCCTGC 


-27 




CGCTCATGC 


-28 




CGCTCGTGC 


-29 




CGCAGTTGC 


-30 




CGCAGCTGC 


-31 




CGCTCTTGT 


-32 




CGCTCCTGT 


-33 




CGCTCATGT 


-34 




CGCTCGTGT 


-35 




CGCAGTTGT 


-36 




CGCAGCTGT 
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FIG- 46-1 Translation of DNA k9-l 

GlyCysProGluArgLeuAlaSerCysArgProLeuThrAspPheAspGlnGlyTrpGly 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGAO^GG 

ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCCGCGAAGAGTGTGTGtGGTCCGGTATATTGCTTCA 
GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGT 

ProSerProValValValGlyThrThrAspArgSerGlyAlciProThrTyrSerTrpGly 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGGGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
w CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGC ACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGXCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 



TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnTyrThxIlePheLysIleArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACTATATTTAAAATCA 

TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGATATAAATTTTAGT ' 

r MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAGCACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTCGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGG 

^ ArgCy s AspLeuGluAspAr gAspArgS erGluIieuSerProIieuIieuLeuThrThrTh^ 

601 AACGTTGCGATCTGGAAGATAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCTATCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 

GlnTrpGlnValLeuProCysSerPheThrThrLeuProAlal^uSerThrGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTGCCAGCCOTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGACGGTCGGAACAGGTGGCCGGAGT 



Overlap with Combined ORF of DNAs 12f through 15e 

HisLexiHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 



SerTrpAlalleLysTrpGluTyrValValLetil^euPheLeuIieuLeuAlciAspAlaArg 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTCCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAGGAGGACAAGGAAGACGAACGTCTGCGCG 
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ValCysSerCysIieuTrpMety-etLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsn 
841 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAAGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTTCGCCGAAACCTCT 

LeuVallleLeuAsnAlaAlaSerLeiiAlaGlyThrHisGlyLeuValSerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTCGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 



PhePheCysPheAlaTrpTyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPhe 
961 TGrTCTTCTGCTTTGCATGGTATCTGAAGGGXAAGTGGGTGCCCGGAGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAGACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 



TyrGlyMetTrpProI^uLeuLeuLeuXieuLeuAlaLeuProGlnArgAlaTyrAlaLeu 
1021 TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 



AspThrGluValAlaAlaSerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThr 
1081 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTAA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGC^^ 

LeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrpLeuGlnXyrPheLeu 
114 1 CTCTGTCACCATATTACAAGCGCTATATCAGCTGGOXSCTTGTGGTGGCTTCAGTATTTTC 
GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 



ThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsnValArgGlyGlyArg 
1201 TGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGC 



ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 



AspAlaVallleLeiiLeuMetCysAlaValHisProThrLeuValPheAspIleThrLys 
1261 GCGACGCTGTCATCTTACTCATGOXSTGCTGTACACCCGACTCTGGTATTTGACATCACCA 
, CGCTGCGACAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 



LeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAla 
1321 AATTGCTGCTGGCCGTCTTCGG ACCCCTTTGGATTCTTCAAGCCAG 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTC 



FIG. 46-2 
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FIG 47—1 ORF OF DNAs K9-1 through 15e 

GlyCysProGluArgLeuAlaSerCysArgProLeuThrAspPheAspGlnGlyTrpGly 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHtsTyrPro 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCA 
GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGT 

ProSerProValValValGlyThrThrAspArgSerGlyAlaProThrTyrSerTrpGly 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
241 GTGAAAATG ATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTCGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnTyrThrllePheLysIleArg 
481 ACTACCCGTATAGGCTTTCGCATTATCCXTGTACGATCAACTACACCATATTTAAAATC^ 
TGATGGGGATATCCGAAACCGT2VATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT. 

MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

ArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeuLeuLeuThrThrThr 
601 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 

GlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThrGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGT 

HisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

SerTrpAlalleLysTrpGluTyrValVall^uLeuPheLeuLeuLeuAlaAspAlaArg 
781 CGTCCTGGGCCATt AAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCG 

ValCysSerCysLeuTrpMetMetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsn 
841 GCGTCTGCTCCTGCTTCTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCT 

LeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 
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PhePheCysPheAlaTrpTyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPhe 
961 TGTTCTTCTGCTTTGCATGGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 

TyrGlyMetTrpProI^uLeuLeuLeiiLeixI^uAlaLeuProGlnArgA^ 
1021 TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTOXSCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 

AspThrGluValAlaAlaSerCysGlyGlyVaiValLeuValGlyr^uMetAlaL 
1081 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGCCC^ 

LeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrpI^uGlnTyrPhe^ 
1141 CTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGG 

GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAA^ 

ThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsnValArgGlyGlyAra 
1201 TGACCAGAGTGGAAGGGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGC 
ACTGGTCTCACCTTCGCGTTGAGGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 

; , AspAlaVallleLeuLeuMetCysAlaValHisProThrLeuValPheAspIleThrLys 

1261 GCGACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACa^ 
CGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 

, ^ ^ , LeuLeuLeuAlaValPheGlyProI^uTrpIlel^uGlnAlaSerl^iiI^uLysValPro 
1321 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTAC 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATG 

^ « -r ^^P^^ValArgValGlnGlyl^uLeuArgPheCysAlaLeuAlaArgLysMetlleGlv 
1381 CCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCG 
GGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGC 

1441 GAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTT 
CTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAA 

1 can ^^AsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArgAspr^uAlaVa^ 

15 01 ATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGG 

OIATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACC ' 

V^Gl^roValValPheSerGlnMetGluThrLysLeuIleThrTrpGlyAlaA^ 
1561 CTGTAGAGCCAGTCGTCTTCTCCCAAATCGAGACCAAGCTCATCACGTGGGGGGCA^^ 

GACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTAT 

-r ^ ^^aCysGlyAspIlelleAsnGlyLeuProValSerAlaArgArgGlyArgGluIle 
1621 CCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGA 
^ GGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCT 

^ ^^e^GlyProAlaAspGlyMetValSerLysGlyTxpArgI^uLeii2aaProIleThr 

16 81 TACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCA 
ATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGT 

, , , , AlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCys IlelleThrSerLeuThrGlyAra 
1741 CGGCGTACGCCCAGCAGACAAGGGGCCXCCTAGGGTGCATAATCACCAGCCTAAC^ 
GCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATT^ 

^PLysAsnGlnValGluGlyGluValGlnlleValSerThrAlaAlaGlnThrPheLeu 
1801 GGGACAAAAACCAAGTGGAGGGTGiUSGTCCAGATTGTGTCAACT^ 
CCCTGTTTTTGGTXCACCTCCCACTCCAGGTCTAACACAGT^ 

o ^^^^ThrCysIleAsnGlyValCysTrpThrValTyrHisGlyAlaGlyThrArgThrlle 
1861 TGGCAACGTGCATCAAOX^GGGTGTGCTCGACTGTCTACCACGGGGCCGGAACGA^^ 

ACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGT 

AlaSerProLysGlyProVallleGlnMetTyrThrAsnValAspGlnAspLeuValGly 
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1921 TCGCGTCACCCAAGGGTCCTGTOVTCCAGATGTATACCAATGTAGACCAAGACCTTGTGG 
AGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACC 

TrpProAlaProGlnGlySerArgSerLeuThrProCysThrCysGlySerSerAspLeu 
1981 GCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACC 
CGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGG 

TyrLcuValThrArgHisAlaAspVallleProValArgArgArgGlyAspSerArgGly 
2041 TTTACCTGGTCACGAGGCACGCCGAXGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGG 
AAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCC 

SerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSerGlyGlyProLeuLeu 
2101 GCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGT 
CGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACA 

CysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCysThrArgGlyValAla 
2161 TGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGG 
ACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACC 

LysAlaValAspPhelleProValGluAsnLeuGluThrThrMetArgSerProValPhe 
2221 CTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGT 
GATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACA 

OrhrAspAsnSerSerProProValValProGlaSerPheGlnValAlaHisLeuHisAla 
2281 TCACGGATAACa?CCTCTCCACCAGTAGTGCCCCAGAGCa?TCCAGGTGGCTCACCTCCATG 
AGTGCCTATTGAGGAGAGGTGGXCATCACGGGGXCTCGAAGGTCCACCGAGTGGAGGTAC 

ProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLys 
2341 CTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATA 
GAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATAT 

ValLeuVall^uAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyriletSerLys 
24 01 AGGTGCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCA 
TCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCeGAAACCACGAATGTACAGGT 

AlaHisGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerPro 
2461 AGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCC 
TCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGG 

IleThjrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyr 
2521 CCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTX 
GGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAA 

AspIlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGly 
2581 AOXSACATAATAATTTGTGACGAGTGCCACTCCACGG ATGCCACATCCATCTTCGGCATCG 
TACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGC 

ThrValLexiAspGlnAlaGluThrAlaGlyAlaArgLeuValVall^iiAlaThrAlaThr 
2641 GCACOXSTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCA 
CGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGT 

ProProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThr 
2701 CCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCA 
GGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGT 

GlyGluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHis 
2761 CCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGAC 
GGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTXCATTAGTTCCCCCCCTCTG 

LeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeu 
2821 ATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCAT 
TAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTA 

GlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGly 
2881 TGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTG ACGTGTCCGTCATCCCGACCAGCG 
ACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGC 
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AspValValValValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSer 
2941 GCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACT 
CGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGA 

VallleAspCysAsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPhe 
3001 CGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
GCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAAC^ 

ThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlvA^^ 
3061 TCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGC^ 
AGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGT 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSer^ 
3121 GGACTGGCAGGGGGAAGCCAGGCATCTACAGATTOXSTGGCACCGGGGGAGCGCCCCTCCG 
CCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGC 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeu 
3181 GCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGC 
CGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCG 

L) ThrProAlaGluThrThrValArgLeiiArgAlaTyrMetAsnThrPrc^^ 

3241 TCACGCCCGCCGAGACEACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCG 
AGTGCGGGCGGCTCTGATGTCAAXCCGATGCTCGCATGTACTTGTGGGGCCCCGAAG 

^ ^ ^ , Of sGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
3301 TGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATG 
ACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC 

, ^ ^ , HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrl^uValAlaa?y^ 
3361 CCCACTTTCXATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACC 
GGGTGAAAGATAGGGTCTGTTTCGTCXCACCCCTCTTGGAAGGAATGGACCATCGCATGG 

, , ^ , AlaThrValCy s AlaArgAlaGliiAlaProProProSerTrpAspGlnMetTrpLy sCys 
3421 AAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 
TTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCA 

r>A^^ L^^Il^ArgLeuLysProTlurl^iiHisGlyProThrProLeuLeuTyrArgLeuGlvA^ 
3481 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTC^ 

CAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATCTCTGACCCGC ' 

^ , , "V^alGliiAsnGluIleThrLeuThrHisProValThrLysTyrlleMetT 
3541 CTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATC^ 

GACAAGTCTTACXTTAGTGGGACTGCGTGGGTCAGTGGTTTATCTAGTACTGTACGTACA 

AlaAspLeuGluValValXhrSerThrXrpValLeuValGlyGlyVall^uAlaAlal^u 
3601 CGGCCGACCXGGAGGXCGXCACGAGCACCXGGGXGCXCGXXGGCGGCGXCCXGGCXGCXX 
:^ GCCGGCTGGACCXCCAGCAGXGCXCGXGGACCCACGAGCAACCGCCGCAGGACCGAC^ 

AlaAlaTyrCysLeuSerXhrGlyCysValVallleValGlyArgValValLeuSerGly 
3661 TGGCCGCGTAXXGCCXGXCAACAGGCXGCGXGGTCAXAGXGGGCAGGGXGGTCXXGTCCG 
ACCGGCGCAXAACGGACAGXIGXCCGACGCACCAGXAXCACCCGXCCCAGCAGAACAGGC 

Ly sProAlallelleProAspArgGluValLeuXyrArgGluPheAspGluMetGluGlu 
3721 GGAAGCCGGCAAXCAXACCXGACAGGGAAGTCCTCXACCGAGAGTXCGAXGAGAXGGAAG 
CCTTCGGCCGXTAGXAXGGACXGXCCCXXCAGGAGAXGGCTCTCAAGCXACXCTACCXXC 

CysSerGlnHisI^uProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLvsGln 
3781 AGTGCXCXCAGCACXXACCGXACAXCGAGCAAGGGAXGATGCTCGCCGAGCAGTTCAAGC 
XCACGAGAGTCGXGAAXGGCATGXAGCXCGXTCCCXACTACGAGCGGCXCGXCAAGXXCG 

LysAlaLeuGlyLeuLeuGlnXhrAlaSerArgGlnAlaGluVallleAlaProAlaVal 
3841 AGAAGGCCCXCGGCCXCCXGCAGACCGCGXCCCGXCAGGCAGAGGXTAXCGCCCCXGCTG 
XCXTCCGGGAGCCGGAGGACGXCXGGCGCAGGGCAGXCCGTCXCCAAXAGCGGGGACGAC 

GlnXhrAsnXrpGlnLysLeuGluThrPheXrpAlaLysHisMetXrpAsnPhelleSer 
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TCCAGACCAACTGGCVlAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCA 
AGGTCTGGTIGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGT 

GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
GTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCAT 
CACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTA 

MetAlaPheThrAlaAlaValTlirSerProLeuThrThrSerGlnThrLeuLeuPheAsn 
TGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCA 
ACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGT 

IleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
ACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTCCCGCTACTGCCTTTG 
TGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAAC 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
TGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAG 
ACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATC 

IleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSer 
ACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTCTGGCATTCAAGATCATGA 
TGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACT 

GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlalleieuSerProGly 
GCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCG 
CGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGC 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
GAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCG 
CTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCC^ 

GlyAlaValGlnXrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
AGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTT 
TCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAA 

ProThrHisTyrValProGluSerAspAlaAlaAlaArgValThrAlalleLeuSerSer 
CCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCA 
GGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGT 

LeuThrVcaThrGlnLeuLeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThr 
GCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCA 
CGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATG^ 

ProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeuSerAsp 
CTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCG 
GAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGC 

PheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGlylleProPheValSer 
ACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTCCCTGGGATCCCCTTTGTGT 
TGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACA 

CysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMetHisThrArgCysHis 
CCTGCCAGCGCGGGTATAAGGGGGTCTCGCGAGTCGACGGCATCATGCACACTCGCTGCC 
GGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGG 

CysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArglleValGlyProArg 
ACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTA 
TGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGC^^ 

ThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGlyPrc^ 
GGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCT 
CCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGOXJGTGCCCGGGGA 

ThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyr 
GTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAAT 
CATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTA 
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ValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeu 
4921 ATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATC 
TACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAG 

LysCysProCysGlnValProSerProGluPhePheThrGlxiLeuAspGlyValArgLeu 
4981 TCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCC 
AGTTTACGGGCACGGTCCAGGGTAGCGGGCTOilAAAAAGTGTCTTAACCTGCCCCACGCGG 

HisArgPheAlaProProCysLysProLeijIieiiArgGluGluValSerPheArgValGly 
5041 TACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAG 
ATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATC 

LeuHisGluTyrProValGlySerGlnLeuProCysGluProGluProAspVcQJU.aVaJ, 
5101 GACTCCACGAATACCCGGTAGGGTCGCAATTACGTTGCGAGCCCGAACCGGACGTGGCCG 
CTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGC 

I^uThrSerMetLeuThrAspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeu 
5161 TGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGT 
ACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCA 

C AlaArgGlySerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSer 
X ' 5221 TGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCAT 

ACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTA 

LeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsn 
5281 CTCTCAAGGCAACTa?GCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCA 
GAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGT 

LeiiLeuTrpArgGlnGluMetGlyGlyAsnlleThrArgValGluSerGluAsnLysVal 
5341 ACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAG 
TGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTC 

VallleLeuAspSerPheAspProLeuValAlaGluGluAspGlxiArgGluIleSerVal 
5401 TGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCG 
ACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGC 

ProAlaGluIleLeuAi^LysSerArgArgPheAlaGliiAlaLexiProValTrpAlaArg 
5461 TACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGC 
AXGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCG ' 

ProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTyrGliiProProVal 

C 5521 GGCCGGACTA1!AACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTG 
CCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGAC 

ValHisGlyCysProLeuProProProLysSerProProValProProProArgLysLys 
5581 TGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGA 
^ ACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCT 

ArgThrValValLeuThrGliiSerThrLeuSerThrAlalieuAlaGliiLeuAl^^ 
5641 AGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCA 
TCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCOXZGAGCGGTGGT 

SerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrXhrThrSerSerGlu 
5701 GAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCT^ 

CTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGAC 

ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
5761 * AGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCC 
TCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGG 

LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSer 
5821 CCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTA 
GGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCAT 

GluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThrGlyAlalieu 
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5881 GTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCAC 
CACTCCGGTTGCGCCTCCTACAGCACACGACGAGtTACAGAATGAGAACCTGTCCGCGTG 

ValThrProCysAlaAlaGluGluGlnLysLeuProlieAsnAlaLeuSerAsnSerLeu 
5941 TCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGT 
AGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCA 

LeuArgHisHisAsnLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
6001 TGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGOVGTGCTTGCCAAAQ 

ACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCT 

LysValThrPheAspArgLeuGlnValLeuAspSerHisTyrGlaAspValLeuLysGlu 
6061 AGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGG 
TCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCC 

ValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGluAlaCysSer 
6121 AGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCA 
TCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGT 

LeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAspValArgCys 
6181 GCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCC^ 

CGGACTGCGGGGGTGTGAGOXIGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAA 

HisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeuGluAspAsn 
6241 GCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGOXSTGGAAAGACCTTCTGGAAGACA 
CGGTACGGXCTTTCCGGCATOXSGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGT 

ValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysValGlnProGlu 
6301 ATGTAACACCAATAG ACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTG 
TACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGAC 

LysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyValArgValCys 
6361 AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGAXCTGGGCGTGCGCGTGT 
TCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCAC^ 

GlxiIiysMetAlaLeuTyrAspValValThrLysLeuProLeu2U.aValMetGlySerS^ 
6421 GCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCX 
CGCTXTTCIACCGAAACATGCTGCACCAATGXTTCGAGGGGAACCGGCACTACCCTTCGA 

TyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAlaTrpLysSer 
64 8 1 CCXACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGT 
GGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCA 

LysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThrVcLlThrGlu 
6541 CCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTGAC^ 

GGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGXGAC 

SerAspIleArgThrGluGluAlalleTyrGlnCysCysAspLeuAspProGlnAlaArg 
6601 AGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCC 
TCTCGCTGTAGGCATGCCTCCTCCGTTAGATCGTTACAACACTGGAGCTGGGGGTTCGGG 

ValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyProLeuThrAsnSerArg 
6661 GCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATCTTGGGGGCCCTCXTACCAAOT 

CGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTT 

GlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeuThrThrSerCysGly 
6721 GGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTG 
CCCCCClCTTGACGCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGA'rCGACAC 

AsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGlnAsp 
6781 GTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGG 
CATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCC 

CysThrMetLeuValCysGlyAspAspLeuValVallleCysGluSerAlaGlyValGln 
6841 . ACTGCACCATGCTCGTGaXSTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCC 
TGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGG 
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GluAspAlaAlaSerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaProPro 
6901 AGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCC 
TCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGG 

GlyAspProProGlnProGluT3n:AspLeuGluI^uIleThrSerCysSerSerAsnVcLl 
6961 CTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCA!EAA^ 

GACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTAXTGTAGTACGAGGAGGTTG^ 

SerValAlaHisAspGlyAlaGlyLysArgValTyrTyrljeuThrArgAspProTto 
7021 TGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTAC^ 
ACAGTCAGCGGGTGCTGCCGCGACCrrTCTCCCAGATGA!IX^^ 

ProLeuAlaArgAlaAlaTrpGluThrAlaArgHisTteProValAsnSerTrpLeuGl^ 
7081 CCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAG 
GGGGGGAGCGCTCTCGACGCACCCTCOXSTCGXXCTGTGTGAGGTCAGTTAAGGACCGATC 

AsnllelleMetPheAlaProThrl^uTrpAlaArgMetllel^ijMetThrHisPhePhe 
7141 GCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCT 
CGTTGTATTAGTACAAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGA 

SerVall^euIleAlaArgAspGlnLeuGluGlnAlaLexiAspCysGluIleTyrGlyAla 
7201 TTAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGG 
AATCGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCa!AACGCTC 

CysTyrSerlleGluProLeuAspLexiProProIlelleGlnArglieu 
7261 CCTGCTACTCCATAGAACCACTTGATCTACCTCCAATCATTCAAAGACTC 
GGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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